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Preface 



Colin P. Williams* 

Jet Propulsion Laboratory, California Institute of Technology, 

4800 Oak Grove Drive, Pasadena, CA 91109-8099, 
email: Colin.P.Williams@jpl.nasa.gov 

Over the past half century computers have gone from being the room-sized 
servants of a privileged few to the totable companions of business travellers, 
school children, and just about anyone who can point and click a mouse. In part, 
this transformation was made possible by the dramatic miniaturization in the 
basic components of a computer. This trend was quantified in 1964 by Gordon 
Moore, one of the founders of Intel, who noticed that the amount of information 
that could be stored on a given amount of silicon doubled roughly every 18 
months. The doubling trend continues to this day and, by crude extrapolation, 
predicts that the computers of 2020 might be approaching the one-atom-per-bit 
level. 

Physical systems such as atoms, however, behave in ways that are very dif- 
ferent from everyday objects. In fact they are governed by the laws of quantum 
mechanics rather than classical mechanics. In the early 1980’s some foresighted 
physicists, such es Charles Bennett (our conference Chairperson), Rolf Landauer, 
Paul Benioff, David Deutsch, and Richard Feynman, began to question what it 
would mean for a computer to operate at the one-atom-per-bit scale. The el- 
ementary operations of such a computer would need to be described in terms 
of quantum mechanics. Recently, physicists and computer scientists have come 
to appreciate that certain quantum effects, in particular superposition, inter- 
ference, entanglement, non-locality, indeterminism, and non-clonability, allow 
entirely new kinds of tasks to be performed. These tasks include teleporting 
quantum information, establishing shared secret cryptographic keys, searching 
unstructured “virtual” databases, factoring composite integers, simulating phys- 
ical systems, and enhancing the capacity of classical communication channels. 

In an effort to elucidate new ideas, and to push the envelope on existing 
ones, in 1998 my colleagues, Leon Alkalai, Richard Doyle, Amir Fijany, Sandeep 
Gulati, Benny Toomarian, Michail Zak, and I at the Jet Propulsion Laboratory, 
organized the “First NASA International Conference on Quantum Computing 
and Quantum Communications”, in Palm Springs, California. We imposed an 
a priori structure of the meeting that was designed to solicit contributions on 
how quantum computing might impact NASA mission objectives in computation 
and communications. I am delighted to say that NASA obtained great value for 
money from this conference. The papers contained in this volume are a testament 
to the rich diversity of ideas that were presented. 

* Supported by the NASA/JPL Center for Integrated Space Microsystems, NASA 
Advanced Concepts Office and the NASA Information and Computing Research 
Technologies Program 
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NASA’s interest in quantum computing and quantum communications stems 
from its need to solve daunting computational and communications problems. 
In particular, spacecraft design, mission planning, observation scheduling, design 
optimization, image processing, data assimilation, robotic vision, all impose ex- 
treme demands on computational resources. Given that quantum computers are 
known to speed up the solution of some computational problems and facilitate 
more efficient communications, we’d like to circumscribe their capabilities on 
the computational and communications problems of interest to NASA. In this 
regard, the Palm Springs conference was a tremendous success with several new 
ideas emerging for tackling structured search problems, Earth-to-space quantum 
key distribution, improved precision atomic clocks, and quantum gyroscopy. 

The papers appearing in this volume are organized by the following five 
themes: Entanglement and Quantum Algorithms, Quantum Cryptography, Quan- 
tum Copying and Quantum Information Theory, Quantum Error Correction and 
Fault- Tolerant Quantum Computing, and Embodiments of Quantum Computers. 
With such a diverse range of contributions we hope that there will be something 
in this volume to interest everyone. 

I would like to thank the program committee and reviewers who all con- 
tributed greatly to the success of the NASA QCQC’98 conference. I pay special 
thanks to Dr. Charles Bennett of IBM for serving as the conference Chairper- 
son. Finally, I would like to thank our NASA/JPL sponsors who supported 
this conference both financially and intellectually. In particular, I thank the 
NASA/JPL Center for Integrated Space Microsystems (CISM), the Ballistic 
Missile Defense Organization, the NASA/JPL Center for Space Microelectron- 
ics Technology (CSMT), and the NASA Autonomy and Information Technology 
Program Office. 
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Multi-particle Entanglement via Two-Particle 

Entanglement 



Gilles Brassard* and Tal Mor** 

Universite de Montreal, Departement IRO 
C.P. 6128, succursale centre-ville, Montreal (Quebec), Canada H3C 3J7 
{brassard, mor}@iro .umontreal . ca 



Abstract. Entanglement between n particles is a generalization of the 
entanglement between two particles, and a state is considered entan- 
gled if it cannot be written as a mixture of tensor products of the n 
particles’ states. We present the key notion of semi-separability, used to 
investigate n-particle entanglement by looking at two-subsystem entan- 
glement between its various subsystems. We provide necessary conditions 
for n-particle separability (that is, sufficient conditions for n-particle 
entanglement). We also provide necessary and sufficient conditions in 
the case of pure states. By a surprising example, we show that such 
conditions are not sufficient for separability in the case of mixed states, 
suggesting entanglement of a strange type. 



1 Introduction 

Entanglement between two particles A and B provides correlations that have 
no classical counterpart A two-particle pure state can either be a tensor 

product of the one-particle states, or else it contains entanglement between them. 
For mixed states, there is a third possibility, which is a mixture of tensor-product 
states. Such a state is not entangled, and the name separable state is used to 
consider both tensor products and mixtures of tensor products. Fulfilling Bell’s 
inequalities provides necessary conditions for separability. (Equivalently, break- 
ing Bell’s inequalities provides sufficient conditions for entanglement.) There are 
many others necessary conditions for separability, such as the inability for the 
state to be used to teleport a qubit 0 . The other direction is much more subtle; 
necessary conditions for 2-particle entanglement (that is, sufficient conditions 
for separability) were given in some special cases |1()| , but are still missing in the 
general two-particle case j7|. 

For three and more particles, the situation is more complex as we demonstrate 
in this paper. Entanglement between n particles is a generalization of the entan- 
glement between two particles, and a state is considered entangled if it cannot 
be written as a mixture of tensor products of the n particles’ states. We discuss 
properties which, by definition, are non-trivial only for systems composed of 

* Supported in part by Canada’s nserc, Quebec’s fcar, and the Canada Council. 

** Supported in part by Canada’s NSERC and Quebec’s FCAR. 
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more than two particles: two-subsystem entanglement or separability between 
the various possible subsystems. (There are six possibilities in the three-particle 
case.) 

We define k-subsystem semi-separability of a system as follows: Let a system 
contain n particles such that n > k. Divide all particles into k ov k -\- 1 non-empty 
groups of particles (fc or fc -I- 1 subsystems). If we divided into fc -I- 1 subsystems, 
trace out one of them to be left with k subsystems in any case. The original 
system is semi-separable if those k subsystems are separable. 

We shall be interested in two more specific cases, depending on whether we 
trace out one subsystem or not. 

fe-subsystem total semi-separability Divide all particles into k non-empty 
groups {k subsystems) and check separability of the k subsystems, 
fc-subsystem partial semi-separability Divide all particles into fc -I- 1 non- 
empty groups (fc -f 1 subsystems), trace out one subsystem, and check sep- 
arability of the remaining k subsystems. 

We shall refer to these as k-TSS and k-PSS, respectively. The case k = 2 
is of special interest since separability of two particles (two subsystems in our 
case) has been extensively studied. Note that these notions (when k > 2) can 
be reduced to two subsystems semi-separability. We concentrate on fc = 2 in the 
following and drop the index k unless there is a danger of confusion. 

These properties provide a new insight into many-particle entanglement and 
allow us to find a surprising type of entanglement, which shows a new funda- 
mental difference between the properties of pure states and of mixed states. 

We use these properties to provide a partial classification of rr-particle entan- 
glement and separability in terms of the much simpler (albeit still only partially 
solved) problem of separability of two subsystems, each possibly composed of 
several of the original particles. We provide necessary conditions for n-particle 
separability. We also provide necessary and sufficient conditions in the case of 
pure states. These properties may also be useful in the future in providing a 
complete classification of separability versus entanglement of many particles. 
Many-particle entanglement was recently discussed in other works PJHE]. 

2 Conditions for Multi-particle Separability 

Most of the discussion in this section is restricted (for simplicity) to two- 
subsystem entanglements in a system composed of three particles, but is true 
for two subsystems in a system composed of n particles unless explicitly stated 
otherwise. [Possibly, the discussion is also true for k > 2.] Furthermore, we 
restrict our examples to qubits, to make it simple and clear, but all the Facts 
we provide apply to higher-dimensional Hilbert spaces than qubits. 

Let Alice, Bob and Carol (who are spatially separated) have three qubits 
(denoted by A, B and C respectively), prepared in some joint pure state 
where (r) is used to index one of possibly many such states: 
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|^(r)) ^ = aooo|0A0_B0c) + aooi|0A0slc)H , (1) 

2^0 

with |aip = 1. The states |0) and |1) are basis vectors of each qubit, and 
|00) = |0) ® |0). We usually avoid using the tensor product notation 0 unless we 
want to emphasize it. The most general three-qubit state is a mixture of states 
of this type 

( 2 ) 

r 

with 'Bj.Pr = 1- Any such state can be written in various forms. A three-particle 
state is considered separable, or non-entangled (NE), if and only if it can be 
written as a mixture of tensor products 

PNE = ^Ps [p^A ® Pb^ ® Pc V (3) 

S 

A separable pure state can be presented using its Schmidt decomposition |^, 
provided we separate one subsystem at a time (and do it recursively if there are 
more particles). Then it is necessarily in a tensor product of one-particle states 
I'I'ne) = I'I'ab) \4>c) = \(I>a) (S> \4‘b) <8 l^c) where the |^)’s are one qubit states. 

Let us study a three-particle entangled state by looking at the entanglement 
between its various two subsystems. There are three options for partial semi- 
separability: tracing out (ignoring) one particle to be left with two particles. 
Similarly, there are three options for total semi-separability: considering two 
particles as one subsystem, to be left with this combined subsystem and the 
remaining particle. If we trace out particle C and the remaining state of systems 
A and B is separable, we denote this 2-subsystem partial semi-separability by 
PSSc{A; B). If the subsystem composed of AB is separable from C, we denote 
this two-subsystem total semi-separability by TSS{AB; C). Similar notation can 
be written for the other four options obtained by cyclic permutations of the three 
particles. A negation is denoted by TSS{AB',C) saying that AB is entangled 
with C. In the general case of n particles and k subsystems, more terms could 
appear, for instance 4-TSS{A; BC] D] E) and 3-PSScd{A] B; E). Once these 
new notions are established, we can use them to prove some simple facts, relating 
n-particle entanglement to the better investigated entanglement between two 
systems. 

A separable state [such as in Equation (EJ] presents all three possible TSS 
properties. This is immediately obtained from the fact that the state is still 
separable when collecting two particles together: for instance, collecting A and 
B together yields pne = T,sPs[Pab ® Pc^]- Thus, 

Fact 1 If a state does not present all cases of TSS, then it is entangled. 

In other words, the existence of all possible TSS is a necessary condition for 
separability (and the existence of one TSS is a sufficient condition for entangle- 
ment). If one pair of subsystems is entangled (even if other pairs are not) the 
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entire system is entangled. If none of the possible pairs is entangled the system 
might be separable. If each pair of subsystems satisfies a necessary condition for 
separability (for instance, it cannot be purified to a state which violates Bell’s 
inequalities) but is not known to be separable, then the entire system satisfies a 
necessary condition for separability. 

We conjecture that the existence of all possible TSS is also a sufficient condi- 
tion for separability, but this important question is still left open in the general 
case of mixed states. 

A separable state [such as in Equation (0] satisfies 

T'i'CPne = ^PsTi'c[pi"^ p^B ® p'P] = '^PsIp^a ® p'^b] (4) 

s s 

due to the linearity of the trace-out operation. Similar equations can be written 
if A or B are traced out. Therefore it presents all PSS properties. Thus, 

Fact 2 If a state does not present all cases of PSS, then it is entangled. 

In other words, the existence of all possible PSS is a necessary condition for 
separability (and the existence of one PSS is a sufficient condition for entan- 
glement). The converse is not true and a three-particle entangled state might 
present all PSS properties: the GHZ-Mermin state |S| is a good example — see 
SectionEl Therefore, the existence of all PSS is a necessary condition for sepa- 
rability, but not a sufficient one. 

Note that a similar argument implies that if a state fulfills TSS{A; BC) 
then automatically it also fulfills PSSc{A; B) and PSSb{A;C), and similar 
conclusions can be obtained by cyclic permutations. 

For pure states, we now obtain conditions that are both necessary and suf- 
ficient for separability. (We need only prove that they are sufficient, since the 
fact that they are necessary is already shown using the previous facts.) We omit 
Dirac’s braket notation unless confusion could arise. 

Fact 3 A pure state that presents all TSS properties (for all possible decom- 
positions of two subsystems) is separable. 

Proof. We prove it in the case of three particles and leave the general case for 
the final paper. If the three-particle pure state is totally semi-separable for all 
possible ways of decomposing the subsystems then it can be written as Tabc = 
Tab ®Tc- If Tab can be written as Ta ® Tb then Tabc = Ta®Tb ®Tc and 
the state is separable. 

Since Tabc presents all TSS properties it can also be written as Tabc = 
Ta®Tbc- Assuming for a contradiction that Tab cannot be written as Ta®Tb, 
it can be decomposed (due to Schmidt decomposition 0) as aTA®TB+fiT'j^®T'B 
with a 7 ^ 0 and /3 0, where y' denotes an orthogonal state to y for any 

state y. Now Tabc = cPTa ® {Tb ®Tc) + /3T'^ ® {Tg ® Tc) is (by treating B and 
C together) the Schmidt decomposition of A and BC, showing entanglement 
between A and BC, in contradiction to Tabc = T a® Tbc- Q 
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Fact 3 can be strengthened. In the case of three particles, we have: 

Fact 3' A three-particle pure state that is TSS twice is separable. 

Proof. Without loss of generality, suppose that the two TSS properties are 
TSS{AB; C) and TSS{A; BC). We can make use of the proof of Fact 3 since it 
used only these two semi-separabilities anyhow. □ 

Fact 4 A pure state that presents a particular TSS in which one particle is 
separable from all the others, and also presents all PSS, is separable. 

Proof. Again, we prove it in the case of three particles and leave the general case 
for the final paper. Since the three-particle pure state is totally semi-separable 
once [say, TSS{AB; C)] it can be written as Tabc = 'I^ab <8 Tq. We now trace 
out particle C to get Tab = Ta®Tb since the state presents all PSS. Thus, 
Tabc = ® Tb ® Tc- □ 

We conclude that Facts 3, 3' and 4 provide necessary and sufficient conditions 
for the separability of pure states. 

Surprisingly, Fact 4 is not true for mixed states. We present in Section 21 a 
three-particle mixed state that presents one case of TSS [say, TSS{A; BC)] and 
all PSS, yet it is entangled. 

3 A Few Simple Examples 

Let us first provide some examples of pure states in order to obtain a better 
intuition about the meaning of the different semi-separability conditions. We also 
describe the techniques required for verifying the existence of semi-separabilities. 
Although such tools are not used here, they might be required for a more com- 
plete analysis of the possible connections between semi-separability of mixed 
states. 

Our first example is 

\TEn) = \Ta) O \Tbc) (5) 

with \T~) = (l/-\/2)[|01) — |10)] is the singlet state of two qubits. When the 
non-entangled particle is traced out, the other two are still entangled so we 
have PSSa{B',C), but when B or C are traced out the remaining particles 
are not entangled, so we have PSSb{A; C) and PSSc{A; B). Clearly it is also 
TSS{A; BC), TSS{AB;C) and TSS{AC; B): When the two particles A and 
B are considered as one subsystem AB, it is fully entangled with subsystem 
C since the state can be written as \TaC) O |1) — \Ta^) ® |0), and the states 
YPaC) and \TAi) play the role of the relevant two basis vectors |0) and |1) of the 
four-dimensional subsystem. 

Consider a three-qubit case. If we write the three-particle general pure state 
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tracing out subsystem A shall leave the other two subsystems in a (possibly 
mixed) state 

\k=0 / \ 1=0 ) \k=0 ) \i=0 

3 3 

= '^'^{aokaoi + aikah)\k){l\. (7) 
1^0 

Using the transposition technique recently discovered by Peres cni, it is easy to 
check whether or not this reduced density matrix is separable for any particular 
case. This result (and similar results for tracing out the other particles) can be 
used to verify if entanglement between two particles exists. For larger systems 
(such as qutrits, which are 3-dimensional Hilbert spaces), no perfect way to tell 
if the two remaining qutrits are separable have been found so far, and it is known 
that Peres’s criterion does not apply j7]. 

To verify that all cases of PSS are present, the three possibilities must be 
checked. For instance, the state 

, cos0,„„„, cos0,„_, sin0_„„, sin0,_ , 

1'1'abc) = ^ |000) + ^ |011) + ^ 

(with |000) = |0 a0b0c)) leads to the reduced density matrix 
PBC = \ 

which is entangled for 9 ^ -k 

Checking the TSS property is sometimes impossible even for 3 qubits, using 
the best techniques currently available. If we write the three-particle general 
pure state of Equation @ using a Schmidt decomposition, it can be written as 

\P) = cos6»|(/)a)|xbc) + sin6»e*‘^|(/)'^)|x'Bc) (10) 

(where |^') stands for a state orthogonal to li/')). Although it is not obvious to 
find this decomposition, this can always be done P|. When both terms appear 
in the Schmidt decomposition, the state is TSS{A; BC). Generalizing this argu- 
ment to mixed states is usually impossible with the current techniques. Peres’s 
transposition uni works unconditionally 0 only for two qubits (2x2) and for 
a qubit and a qutrit (2 x 3), and it can only be used to check entanglement 
of a four-dimensional system with a two-dimensional system, but not to check 
separability. 

For the mixed state that is of interest to us, it is possible to find the appro- 
priate Schmidt decomposition as we shall see in Sectional 



/ 1 0 0cos26»^^ 

0 0 0 0 
0 0 0 0 
\cos26i0 0 1 j 



(9) 
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In the following examples of pure states, the state is already given in such a 
Schmidt decomposition (for all possible combinations of pairs), so they clearly 
present no TSS. For a GHZ-Mermin state jOl 

^GHZM = (1/'s/2)[|0^0b0c) + |1a1_b1c)] i (11) 

let us consider the particles of Bob and Carol as one four-dimensional subsystem. 
Then this state becomes a Bell state 

(1/V2)[|0a(00)bc) + U(11)bc)] (12) 

so that 1 00) and |11) play the role of |0) and |1), the relevant two basis vectors 
of the four-dimensional subsystem. Clearly, the same is true for the other two 
cases. This state presents all PSS: when one particle is traced out, the other 
two are in a mixture-of-product state. Similar arguments apply to the following 
state: 

^Zei = [1/V2)[\0 a'I'^c) + > (13) 

with |>F+) = (l/-\/2)[|01) -I- |10)]. Particle A is entangled to the subsystem com- 
posed of BC together. By transforming to the |0 ± 1) basis of particle A, this 
state becomes a CHZ-Mermin state, so it presents all properties as above. 

As we previously said, three-particle entangled pure states can at most 
present one TSS. We conjecture that this is also true for a mixed entangled 
state, but we leave this question opeifl for future research. 

4 A Surprising Example 

We now present our surprising example: a three-particle mixed state that 
presents one TSS and all PSS, but yet is entangled! [This should be contrasted 
with Fact 4.] 

Consider the state that is composed of an equal mixture of the two states 
\Ti) = |0a) C \Tbc) and [<^ 2 ) = |1a) C and thus 

p = i [(|0^)(0^|) C + (|U)(U|) C i\^Bc){^Bc\)] • (14) 

This state presents all PSS: when particle A is traced out, the other particles are 
left in an equal mixture of the two Bell states |!F+) and |>F“), which is separable 
since this is the same as an equal mixture of jOl) and jlO); when particle B 
(resp. C) is traced out, it is clear that the particle entangled to it, C (resp. B), 
does not become entangled with A. This state also presents TSS{A; BC): when 
subsystems B and C are considered as one subsystem, A is clearly separable 
from it since they are written as mixture of products. 

To prove that it is an entangled state, let us show that it is not semi- 
separable for TSS{AB; C). Then, Fact 1 implies that the three-particle state 

^ The analysis is much more complicated since semi-separability for mixed state does 
not mean tensor product but mixture-of-tensor-products. 
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is entangled. The two pure states that are mixed to give p can be written as 
I ' i ' i ) = |0a0b) G) |1c) + |0a1b) G) |0c) and {^ 2 ) = |1a0b) |1c) — \^a^b) ® |0c), 

up to normalization. Now we can see that C is entangled to different two- 
dimensional subspaces of AB in each of these pure states, thus mixing cannot 
reduce or cancel this entanglement. 

Such a state is surprising: although Alice is not entangled with the subsystem 
of Bob and Carol (together) she can control their entanglement. If she measures 
in the |0 ± 1) basis, Bob and Carol will not be entangled whatsoever, but if 
she measures in the computational basis. Bob and Carol are entangled with- 
out knowing it, and their state depends on Alice’s measurement result. Thus, 
Bob and Carol are in a separable state if they ignore (trace out) Alice’s knowl- 
edge, but they become entangled once they receive Alice’s result using classical 
communication . 



5 Generalizations and Conclnsions 

To summarize, we analysed three-particle entanglement/separability (and be- 
yond) in terms of its possible two-subsystems entangle me nts/separability, which 
we call total and partial semi-separability. We presented necessary conditions 
for separability and also sufficient conditions in the case of pure states. We also 
discussed possible generalizations and presented a surprising state. 

Generalizations to larger systems are more complicated. A four-qubit 
system A, B, C and D, can be discussed in terms of two-particle/three- 
particle entanglement, and semi-separability to 2-TSS{A; BCD) etc., and 
2-TSS{AB; CD) etc., and 5-TSS{A; B; CD) etc., and also 5-PSSa{B; C; D) 
etc., and 2-PSSab{C; D) etc., where all the “etc.” refer to permutations of 
the particles. A relatively simple (and yet interesting) example can be built 
by replacing |0 a) and |1 a) of the previous example by two Bell states of two 
particles {A and D)\ 

p = 1 [unDmDi) ® iincunci) + unDunDi) ® H'l'scmd) ] ad 

so that Bob and Carol are entangled if Alice and David (D) measure their 
entanglement, but if Alice and David measure in a product basis. Bob and Carol 
become disentangled. 
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Abstract. The quantum Fourier transform (QFT), a quantum analog 
of the classical Fourier transform, has been shown to be a powerful tool 
in developing quantum algorithms. However, in classical computing there 
is another class of unitary transforms, the wavelet transforms, which are 
every bit as useful as the Fourier transform. Wavelet transforms are used 
to expose the multi-scale structure of a signal and are likely to be use- 
ful for quantum image processing and quantum data compression. In 
this paper, we derive efficient, complete, quantum circuits for two rep- 
resentative quantum wavelet transforms, the quantum Haar and quan- 
tum Daubechies transforms. Our approach is to factor the classi- 
cal operators for these transforms into direct sums, direct products and 
dot products of unitary matrices. In so doing, we find that permuta- 
tion matrices, a particular class of unitary matrices, play a pivotal role. 
Surprisingly, we find that operations that are easy and inexpensive to 
implement classically are not always easy and inexpensive to implement 
quantum mechanically, and vice versa. In particular, the computational 
cost of performing certain permutation matrices is ignored classically 
because they can be avoided explicitly. However, quantum mechanically, 
these permutation operations must be performed explicitly and hence 
their cost enters into the full complexity measure of the quantum trans- 
form. We consider the particular set of permutation matrices arising in 
quantum wavelet transforms and develop efficient quantum circuits that 
implement them. This allows us to design efficient, complete quantum 
circuits for the quantum wavelet transform. 



Key Words: Quantum Computing, Quantum Algorithms and Circuits, Wavelet 
Transforms 

1 Introduction 

The field of quantum computing has undergone an explosion of activity over the 
past few years. Several important quantum algorithms are now known. Moreover, 
prototypical quantum computers have already been built using nuclear magnetic 
resonance [1, 2] and nonlinear optics technologies [3]. Such devices are far from 
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being general-purpose computers. Nevertheless, they constitute significant mile- 
stones along the road to practical quantum computing. 

A quantum computer is a physical device whose natural evolution over time 
can be interpreted as the execution of a useful computation. The basic element 
of a quantum computer is the quantum bit or ’’qubit”, implemented physically 
as the state of some convenient 2-state quantum system such as the spin of an 
electron. Whereas a classical bit must be either a 0 or a 1 at any instant, a qubit 
is allowed to be an arbitrary superposition of a 0 and a 1 simultaneously. To 
make a quantum memory register we simply consider the simultaneous state of 
(possibly entangled) tuples of qubits. 

The state of a quantum memory register, or any other isolated quantum 
system, evolves in time according to some unitary operator. Hence, if the evolved 
state of a quantum memory register is interpreted as having implemented some 
computation, that computation must be describable as a unitary operator. If the 
quantum memory register consists of n qubits, this operator will be represented, 
mathematically, as some 2" x 2" dimensional unitary matrix. 

Several quantum algorithms are now known, the most famous examples be- 
ing Deutsch and Jozsa’s algorithm for deciding whether a function is even or 
balanced [4], Shor’s algorithm for factoring a composite integer [5] and Grover’s 
algorithm for finding an item in an unstructured database [6]. However, the 
field is growing rapidly and new quantum algorithms are being discovered every 
year. Some recent examples include Brassard, Hoyer, and Tapp’s quantum algo- 
rithm for counting the number of solutions to a problem [7], Cerf, Grover and 
Williams quantum algorithm for solving NP-complete problems by nesting one 
quantum search within another [8] and van Dam, Hoyer, and Tapp’s algorithm 
for distributed quantum computing [9]. 

The fact that quantum algorithms are describable in terms of unitary trans- 
formations is both good news and bad for quantum computing. The good news is 
that knowing that a quantum computer must perform a unitary transformation 
allows theorems to be proved about the tasks that quantum computers can and 
cannot do. For example, Zalka has proved that Grover’s algorithm is optimal 
[10]. Aharonov, Kitaev, and Nisan have proved that a quantum algorithm that 
involves intermediate measurements is no more powerful than one that post- 
pones all measurements until the end of the unitary evolution stage [11]. Both 
these proofs rely upon quantum algorithms being unitary transformations. On 
the other hand, the bad news is that many computations that we would like to 
perform are not originally described in terms of unitary operators. For example, 
a desired computation might be nonlinear, irreversible or both nonlinear and 
irreversible. As a unitary transformation must be linear and reversible we might 
need to be quite creative in encoding a desired computation on a quantum com- 
puter. Irreversibility can be handled by incorporating extra ’’ancilla” qubits that 
permit us to remember the input corresponding to each output. But nonlinear 
transformations are still problematic. 

Fortunately, there is an important class of computations, the unitary trans- 
forms, such as the Fourier transform, Walsh-Hadamard transform and assorted 
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wavelet transforms, that are describable, naturally, in terms of unitary opera- 
tors. Of these, the Fourier and Walsh-Hadamard transforms have been the ones 
studied most extensively by the quantum computing community. In fact, the 
quantum Fourier transform (QFT) is now recognized as being pivotal in many 
known quantum algorithms [12]. The quantum Walsh-Hadamard transform is a 
critical component of both Shor’s algorithm [5] and Grover’s algorithm [6]. How- 
ever, the wavelet transforms are every bit as useful as the Fourier transform, at 
least in the context of classical computing. For example, wavelet transforms 
are particularly suited to exposing the multi-scale structure of a signal. They 
are likely to be useful for quantum image processing and quantum data com- 
pression. It is natural therefore to consider how to achieve a quantum wavelet 
transform. 

Starting with the unitary operator for the wavelet transform, the next step 
in the process of finding a quantum circuit that implements it, is to factor the 
wavelet operator into the direct sum, direct product and dot product of smaller 
unitary operators. These operators correspond to 1-qubit and 2-qubit quantum 
gates. For such a circuit to be physically realizable, the number of gates within 
it must be bounded above by a polynomial in the number of qubits, n. Finding 
such a factorization can be extremely challenging. For example, although there 
are known algebraic techniques for factoring an arbitrary 2" x 2" operator, e.g. 

[13] , they are guaranteed to produce 0(2"), i.e., exponentially many, terms in the 
factorization. Hence, although such a factorization is mathematically valid, it is 
physically unrealizable because, when treated as a quantum circuit design, would 
require too many quantum gates. Indeed, Knill has proved that an arbitrary 
unitary matrix will require exponentially many quantum gates if we restrict 
ourselves to using only gates that correspond to all 1-qubit rotations and XOR 

[14] . It is therefore clear that the key enabling factor for achieving an efficient 
quantum implementation, i.e., with a polynomial time and space complexity, is 
to exploit the specific structure of the given unitary operator. 

Perhaps the most striking example of the potential for achieving compact 
and efficient quantum circuits is the case of the Walsh-Hadamard transform. In 
quantum computing, this transform arises whenever a quantum register is loaded 
with all integers in the range 0 to 2" — 1. Classically, application of the Walsh- 
Hadamard transform on a vector of length 2" involves a complexity of 0(2"). 
Yet, by exploiting the factorization of the Walsh-Hadamard operator in terms 
of the Kroenecker product, it can implemented with a complexity of 0(1) by n 
identical 1-qubit quantum gates. Likewise, the classical FFT algorithm has been 
found to be implementable in a polynomial space and time complexity, quantum 
circuit [15] (see also Sec. 2.3). However, exploitation of the operator structure 
arising in the wavelet transforms (and perhaps other unitary transforms) is more 
challenging. 

A key technique, in classical computing, for exposing and exploiting specific 
structure of a given unitary transform is the use of permutation matrices. In 
fact, there is an extensive literature in classical computing on the use of permu- 
tation matrices for factorizing unitary transforms into simpler forms that enable 
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efficient implementations to be devised (see, for example, [16] and [17]). How- 
ever, the underlying assumption in using the permutation matrices in classical 
computation is that they can be implemented easily and inexpensively. Indeed, 
they are considered so trivial that the cost of their implementation is often not 
included in the complexity analysis. This is because any permutation matrix can 
be described by its effect on the ordering of the elements of a vector. Hence, it 
can simply be implemented by re-ordering the elements of the vector involving 
only data movement and without performing any arithmetic operations. As is 
shown in this paper, the permutation matrices also play a pivotal role in the fac- 
torization of the unitary operators that arise in the wavelet transforms. However, 
unlike the classical computing, the cost of implementation of the permutation 
matrices cannot be neglected in quantum computing. Indeed, the main issue in 
deriving feasible and efficient quantum circuits for the quantum wavelet trans- 
forms considered in this paper, is the design of efficient quantum circuits for 
certain permutation matrices. Note that, any permutation matrix acting on n 
qubits can mathematically be represented by a 2" x 2" unitary operator. Hence, 
it is possible to factor any permutation matrix by using general techniques such 
as [13] but this would lead to an exponential time and space complexity. How- 
ever, the permutation matrices, due to their specific structure (i.e., sparsity 
pattern), represents a very special subclass of unitary matrices. Therefore, the 
key to achieve an efficient quantum implementation of permutation matrices is 
the exploitation of this specific structure. 

In this paper, we first develop efficient quantum circuits for a set of permuta- 
tion matrices arising in the development of the quantum wavelet transforms (and 
the quantum Fourier transform). We propose three techniques for an efficient 
quantum implementation of permutation matrices, depending on the permuta- 
tion matrix considered. In the first technique, we show that a certain class of 
permutation matrices, designated as qubit permutation matrices, can directly be 
described by their effect on the ordering of qubits. This quantum description is 
very similar to classical description of the permutation matrices. We show that 
the Perfect Shuffle permutation matrix, designated as 7T2n , and the Bit Reversal 
permutation matrix, designated as P 2 " , which arise in the quantum wavelet and 
Fourier transforms (as well as in many other classical computations) belong to 
this class. We present a new gate, designated as the qubit swap gate or 7 T 4 , which 
can be used to directly derive efficient quantum circuits for implementation of 
the qubit permutation matrices. Interestingly, such circuits for quantum imple- 
mentation of 772" and P 2 " lead to new factorizations of these two permutation 
matrices which were not previously know in classical computation. A second 
technique is based on a quantum arithmetic description of permutation matri- 
ces. In particular, we consider the downshift permutation matrix, designated 
as Q 2 ") which plays a major role in derivation of quantum wavelet transforms 
and also frequently arises in many classical computations [16]. We show that 
a quantum description of Q 2 " can be given as a quantum arithmetic operator. 
This description then allows the quantum implementation of Q 2 " by using the 
quantum arithmetic circuits proposed in [18]. 
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A third technique is based on developing totally new factorizations of the 
permutation matrices. This technique is the most case dependent, challenging, 
and even counterintuitive (from a classical computing point of view). For this 
technique, we again consider the permutation matrix (52" and we show that 
it can be factored in terms of FFT which then allows its implementation by 
using the circuits for QFT. More interestingly, however, we derive a recursive 
factorization of (52" which was not previously known in classical computation. 
This new factorization enables a direct and efficient implementation of (52"- 
Our analysis of though a limited set of permutation matrices reveals some of 
the surprises of quantum computing in contrast to classical computing. That 
is, certain operations that are hard to implement in classical computing are 
much easier to implement on quantum computing and vice versa. As a specific 
example, while the classical implementation of 772" and P2" are much harder (in 
terms of the data movement pattern) than (52" , their quantum implementation 
is much easier and more straightforward than (52"- 

Given a wavelet kernel, its application is usually performed according to the 
packet or pyramid algorithms. Efficient quantum implementation of theses two 
algorithms requires efficient circuits for operators of the form /2>i-t G 772i and 
U 2 i © 72n_2i, for some i, where © and © designate, respectively, the kronecker 
product and the direct sum operator. We show that these operators can be 
efficiently implemented by using our proposed circuits for implementation of 
7721. We then consider two representative wavelet kernels, the Haar [17] and 
Daubechies 77*-^^ [19] wavelets which have previously been considered by Hoyer 
[20]. For the Haar wavelet, we show that Hoyer’s proposed solution is incomplete 
since it does not lead to a gate-level circuit and, consequently, it does not allow 
the analysis of time and space complexity. We propose a scheme for design of a 
complete gate-level circuit for the Haar wavelet and analyze its time and space 
complexity. For the Daubechies wavelet, we develop three new factorizations 
which lead to three gate-level circuits for its implementation. Interestingly, one 
of this factorization allows efficient implementation of Daubechies wavelets 
by using the circuit for QFT. 



2 Efficient Quantum Circuits for Two Fundamental 
Qubits Permutation Matrices: Perfect Shuffie and 
Bit-Reversal 



In this section, we develop quantum circuits for two fundamental permutation 
matrices, the perfect shuffle, 772", and the bit reversal, P2", permutation ma- 
trices, which arise in quantum wavelet and Fourier transforms as well as many 
classical computations involving unitary transforms for signal and image process- 
ing [16]. For quantum computing, these two permutation matrices can directly 
be described in terms of their effect on ordering of qubits. This enables the design 
of efficient circuits for their implementation. Interestingly, these circuits lead to 
the discovery of new factorizations for these two permutation matrices. 
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2.1 Perfect Shuffle Permutation Matrices 



A classical description of 7T2n can be given by describing its effect on a given 
vector. If Z is a 2"-dimensional vector, then the vector V = is obtained 

by splitting Z in half and then shuffling the top and bottom halves of the deck. 
Alternatively, a description of the matrix 772", in terms of its elements TTjj, for 
i and j = 0, 1, • • • , 2” — 1, can be given as 

_ r 1 if j = i/2 and i is even, or if j = (i — l)/2 + 2"“^ and i is odd 
0 otherwise 

( 1 ) 

As first noted by Hoyer [20] , a quantum description of 772" can be given by 

772" : |an-i a„_2 ■ ■ ■ ai oq) ' — > jao On-i a«-2 • • • oi) (2) 

That is, for quantum computation, 772" is the operator which performs the left 
qubit-shift operation on n qubits. Note that, (7 indicates the transpose) 
performs the right qubit-shift operation, i.e.. 



m 



|On-l Cln-2 ' ' ' Oi Oq) 



\ a „-2 



■ai ao a„_i) 



(3) 



2.2 Bit-Reversal Permutation Matrices 

A classical description of P 2 " can be given by describing its effect on a given 
vector. If Z is a 2"-dimensional vector and Y = P2^Z, then Yi = Zj, for 
i = 0,1, •••,2" — 1, wherein j is obtained by reversing the bits in the binary 
representation of index i. Therefore, a description of the matrix P 2 ", in terms of 
its elements , for i and j = 0, 1, • • • , 2” — 1, is given as 



P — 



1 if j is bit reversal of i 



0 



otherwise 



(4) 



A factorization of P 2 " in terms of ^ 2 * is given as [16] 

P 2 " = P2"(72 <8> P2"-i) ■ ■ ■ (^ 2 * Z) Il 2 n-i) ■ ■ ■ (72"-3 0 IIs){l 2<^-2 0 II 4 ) (5) 



A quantum description of P 2 " is given as 

P 2 " : jon-l Cln-2, ’ ’ ' Ol Oq) ' \o-0 ' Cln-2 On-l) 



( 6 ) 



That is, P 2 " is the operator which reverses the order of n qubits. This quantum 
description can be seen from the factorization of P 2 «, given by (5), and quan- 
tum description of permutation matrices P 2 >- It is interesting to note that for 
classical computation the term ’’bit-reversal” refers to reversing the bits in the 
binary representation of index of the elements of a vector while, for quantum 
computation, the matrix P 2 " literally performs a reversal of the order of qubits. 

Note that, P 2 " is symmetric, i.e., P 2 " = P|n [16]. This can be also easily 
proved based on the quantum description of P 2 " since if the qubits are reversed 
twice then the original ordering of the qubits is restored. This implies that, 
P 2 "P 2 " = 72" and since P 2 " is orthogonal, i.e., P 2 "P|n = I 2 ", it then follows 
that P 2 " = P|n. 
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2.3 Quantum FFT and Bit-Reversal Permutation Matrix 

Here, we review the quantum FFT algorithm since it not only arises in derivation 
of the quantum wavelet transforms (see Sec. 4.3) but also it represents a case 
in which the roles of permutation matrices 772" and P 2 " seems to have been 
overlooked in quantum computing literature. 

The classical Cooley-Tukey FFT factorization for a 2"-dimensional vector is 
given by [16] 

7^2" = A„An-l ■ ■ ■ A 1 P 2 " = F_2nP2n (7) 

where Ai = 72"-i 0 ^ 2 * = ^ ^ I and 772*-i is a diagonal 

Vi \^72i-i — 172i-i y 

matrix given as f22>-i = Diag{l, 1 ^ 2 ^ ^ 2 «> •■•:‘-^ 2 * with Cjj 2 i = e 2 * and 

t = -\/M. We have that = The operator 

F_ 2 n = AnAn-l ■ ■ ■ Ai (8) 

represents the computational kernel of Cooley-Tukey FFT while P 2 " represents 
the permutation which needs to be performed on the elements of the input vector 
before feeding that vector into the computational kernel. Note that, the presence 
of P 2 " in (7) is due to the accumulation of its factors, i.e., the terms (72^ C)772n-i), 
as given by (5). 

The Gentleman-Sande FFT factorization is obtained by exploiting the sym- 
metry of F 2 " and transposing the Cooley-Tukey factorization [16] leading to 

P2" = P2"4 ■■■K_,Ai = P2"£2" (9) 



where 



pt — At At At 

£- 2 " — 



(10) 



represents the computational kernel of the Gentleman-Sande FFT while P 2 " 
represents the permutation which needs to be performed to obtain the elements 
of the output vector in the correct order. 

In [15] a quantum circuit for the implementation of £ 2 ", given by (8), is 
presented by developing a factorization of the operators P 2 « as 



„ 1 fl2i-i 172«-i \ 1 fht-t 72i-l \ fl2i-i 0 \ 

Let C 21 = I ^ ^ ^ I • It then follows that 

I 0 172i-i J 



( 11 ) 



P 21 — (1C 0 721- l)C2* 

A-i = 72"— i P 21 “ (72"— 1 C* IF C* 72^—1 ) (72"— i £21 ) 
In [15] a factorization of the operators C 21 is developed as 



(12) 

(13) 



7/21 — ^n—l,n—i^n—2,n—i ' ' 






where 6jk is a two-bit gate acting on jth and fcth qubits. 



(14) 




Quantum Wavelet Transforms: Fast Algorithms and Complete Circuits 



17 



33 

32 

3i 

3o 




Fig. 1. A circuit for implementation of quantum Fourier transform, QFT (from 
[15]). 

Using (13)-(14) a circuit for implementation of (8) is developed in [15] and 
presented in Fig. 1. However, there is an error in the corresponding figure in 
[15] since it implies that, with a correct ordering of the input qubits, the output 
qubits are obtained in a reverse order. Note that, as can be seen from (7), the 
operator ^ 2 ^ performs the FFT operation and provides the output qubits in a 
correct order if the input qubits are presented in a reverse order. 

The quantum circuit for Gentleman-Sande FFT can be obtained from the 
circuit of Fig. 1 by first reversing the order of gates that build the operator block 
Ai (and thus building operators A\) and then reversing the order of the blocks 
representing operators Ai. By using the Gentleman-Sande circuit, with the input 
qubits in the correct order the output qubits are obtained in reverse order. 

For an efficient and correct implementation of the quantum FFT, one needs 
to take into account the ordering of the input and output qubits, particularly if 
the FFT is used as a block box in a quantum computation. If the FFT is used 
as a stand-alone block or as the last stage in the computation (and hence its 
output is sampled directly), then it is more efficient to use the Gentleman-Sande 
FFT since the ordering of the output qubits does not cause any problem. If the 
FFT is used as the first stage of the computation, then it is more efficient to use 
the Gooley-Tukey factorization by preparing the input qubits in a reverse order. 
Note that, as in classical computation, each or a combination of the Gooley- 
Tukey or Gentleman-Sande FFT factorization can be chosen in a given quantum 
computation to avoid explicit implementation of P 2 " (or, any other mechanism) 
for reversing the order of qubits and hence achieve a greater efficiency. As an 
example, in Sec. 4.3 we will show that the use of the Gooley-Tukey rather than 
the Gentleman-Sande factorization leads to a greater efficiency in quantum im- 
plementation by eliminating the need for an explicit implementation of P 2 ” (or, 
any other mechanism) for reversing the order of qubits. 

2.4 A Basic Quantum Gate for Efficient Implementation of Qubits 
Permutation Matrices 

If a permutation matrix can be described by its effect on the ordering of the 
qubits then it might be possible to devise circuits for its implementation directly. 
We call the class of such permutation matrices as ” Qubit Permutation Matrices” . 
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Fig. 2. The gate (a) and its implementation by using three XOR (Controlled- 
NOT) gates (b). 



A set of efficient and practically realizable circuits for implementation of Qubit 
Permutation Matrices can be built by using a new quantum gate, called the qubit 
swap gate, II 4 , where 



774 = 



/I 0 0 0\ 
0 0 10 
0 10 0 
\0 0 0 1 / 



(15) 



For quantum computation, II 4 is the ’’qubit swap operator”, i.e.. 



774 : loioo) I — > laooi) (16) 

The 7 T 4 gate, shown in Fig. 2. a, can be implemented with three XOR (or 
Controlled-NOT) gates as shown in Fig. 2.b. The II4 gate offers two major 
advantages for practical implementation: 

— It performs a local operation, i.e., swapping the two neighboring qubits. This 
locality can be advantageous in practical realizations of quantum circuits, 
and 

— Given the fact that II4 can be implemented using three XOR (or, Controlled- 
NOT) gates, it is possible to implement conditional operators involving II4, 
for example, operators of the form II4 0 l 2 "- 4 , by using Controlled*-NOT 
gates [21]. 

A circuit for implementation of 772" by using II4 gates is shown in Fig. 3. 
This circuit is based on an intuitively simple idea of successive swapping of the 
neighboring qubits, and implements 772" with a complexity of 0{n) by using an 
0 {n) number of II4 gates. It is interesting to note that, this circuit leads to a 
new (to our knowledge) factorization of 772" in terms of II4 as 

772" = (72"-2 0 T74)(72"-3 0 II4 0 I2) • • • (72"-i 0 II4 0 721 - 2 ) X 
• • • {I2 0 774 0 72"-3)(T74 0 72"-2) 



( 17 ) 
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Fig. 3. A circuit for implementation of Perfect Shuffle permutation matrix, 772" . 



This new factorization of 7T2n is less efficient than other schemes (see, for 
example, [16]) for a classical implementation of 772"- Interestingly, it is derived 
here as a result of our search for an efficient quantum implementation of 772n, 
and in this sense it is only efficient for a quantum implementation. Note also, 
that a new (to our knowledge) recursive factorization of 772i directly results from 
Fig. (3) as 






(18) 



A circuit for implementation of P2" by using 774 gates is shown in Fig. 4. 
Again, this circuit is based on an intuitively simple idea, that is, successive 
and parallel swapping of the neighboring qubits, and implements P2” with a 
complexity of 0{n) by using O(n^) II 4 gates. This circuit leads to a new (to our 
knowledge) factorization of P2" in terms of II 4 as 



P2'* = ((774 (g) 774 ■ ■ ■ 0 774^(72 0 774 (g) ■ ■ ■ 0 774 072))^ (19) 



for n even, and 



P2~ = ((72 (g) 774 (g) ■ ■ ■ (g) 774^ ( 774 (g) ■ ■ ■ T74 (8)72))^(72 0 774 (g) ■ ■ ■ (g) 774) (20) 

n-1 n-1 n-1 

2 2 2 



for n odd. 
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Fig. 4. Circuits for implementation of Bit Reversal permutation matrix, P 2 " , for 
n even (a) and for n odd (b). 



It should be emphasized that this new factorization of P 2 " is less efficient 
than other schemes, e.g., the use of (5) for a classical implementation (see also 
[16] for further discussion). However, this factorization is more efficient for a 
quantum implementation of P 2 " • In fact, a quantum implementation of P 2 " by 
using (5) and (17) will result in a complexity of O(n^) by using O(n^) II 4 gates. 

As will be shown, the development of complete and efficient circuits for im- 
plementation of wavelet transforms requires a mechanism for implementation of 
conditional operators of the forms U 2 i © / 2 "- 2 * and ^ 2 * © 72~-2m for some i. 
The key enabling factor for a successful implementation of such conditional op- 
erators is the use of factorizations similar to (17) and (19)-(20) or, alternatively, 
circuits similar to those in Figures 3 and 4, along with the conditional operators 
involving II 4 gates. 
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3 Quantum Wavelet Algorithms 

3.1 Wavelet Pyramidal and Packet Algorithms 

Given a wavelet kernel, its corresponding wavelet transform is usually performed 
according to a packet algorithm (PAA) or a pyramid algorithm (PYA) . The first 
step in devising quantum counterparts of these algorithms is the development of 
suitable factorizations. Consider the Daubechies fourth-order wavelet kernel of 
dimension 2*, denoted as First level factorizations of PAA and PYA for a 
2 "-dimensional vector are given as 

PAA = (l2n-2 0 ^)(/ 2 n -3 (g) TTg) ' ' ' G X 

(/2n-i-i G 7T2i+i) • ■■(l2 0 (21) 

PYA = (ilf 0 72n_4)(7T8 © / 2 »- 8 ) • • • (17^? 0 72»-20 

( 772^+1 0 72 - - 2 ^+ 1 ) • • • 772 - 77 ^ 1 ^ ( 22 ) 

These factorizations allow a first level analysis of the feasibility and efficiency 
of quantum implementations of the packet and pyramid algorithms. To see this, 
suppose we have a practically realizable and efficient, i.e., 0(i), quantum al- 
gorithm for implementation of For the packet algorithm, the operators 

( 72 ~-> 0 7 ) 2 ?) can be directly and efficiently implemented by using the algo- 
rithm for 77 2 ?. Also, using the factorization of 772 *, given by (17), the operators 
( 72 n-i 0 772 *) can be implemented efficiently in 0(i). 

For the pyramid algorithm, the existence of an algorithm for does not 
automatically imply an efficient algorithm for implementation of the conditional 
operators (T?^? 0 72 >»_ 2 *). An example of such a case is discussed in Sec. 4.4. 
Thus, careful analysis is needed to establish both the feasibility and efficiency 
of implementation of the conditional operators (77 2 ^^ © 72 n_ 2 *) by using the 
algorithm for 772 ^^ Note, however, that the conditional operators ( 772 * © 72 n_ 2 *) 
can be efficiently implemented in 0(i) by using the factorization in (17) and the 
conditional II4 gates. 

The above analysis can be extended to any wavelet kernel (WK) and sum- 
marized as follows: 

— Packet algorithm: A physically realizable and efficient algorithm for the WK 
along with the use of (17) leads to a physically realizable and efficient im- 
plementation of the packet algorithm. 

— Pyramid algorithm: A physically realizable and efficient algorithm for the 
WK does not automatically lead to an implementation of the conditional 
operators involving WK (and hence the pyramid algorithm) but the condi- 
tional operators ( 772 * 0 72 n_ 2 *) can be efficiently implemented by using the 
factorization in (17) and the conditional IT4 gates. 
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Fig. 5. A block- level circuit for Haar wavelet (from [20]). 



3.2 Haar Wavelet Factorization and Implementation 

The Haar transform can be defined from the Haar functions [17]. Hoyer [20] 
used a recursive definition of Haar matrices based on the generalized Kronecker 
product (see also [17] for similar definitions) and developed a factorization of 772" 
as 



772" = (/2"-l (72n-i (g) W 0 /2n_2n-i + l) ’ ’ ’ (IF 0 /2"-2) X 

( 7 T 4 0 /2"_4) • • • (7T2i 0 /2"_20 ■ ■ ■ (-^ 2 "-i ® 72n~l)7T2" (23) 

Hoyer’s circuit for implementation of (23) is shown in Fig 5. However, this repre- 
sents an incomplete solution for quantum implementation and subsequent com- 
plexity analysis of the Haar transform. To see this, let 

^2"^ = (-^2"-! 0 IF) • • • (72n-i 0 IF 0 72n_2~-i+i) • • • (IF 0 /2"-2) (24) 

= (774 0 /2"-4) • • • (7T2i 0 72n_20 • • • (772.-1 0 72 „-i)7T2" (25) 

Clearly, the operator can be implemented in 0(n) by using 0(n) conditional 

(2) 

IF gates. But the feasibility of practical implementation of the operator and 
its complexity (and consequently those of the factorization in (23)) cannot be 
assessed unless a mechanism for implementation of the terms (772* 0 72._20 is 
devised. 

However, by using the factorizations and circuits similar to (17) and Figure 
3, it can be easily shown that the operators (77210 72. _ 2 i) can be implemented in 
0{i) by using 0{i) conditional II 4 gates (or, Controlled*-NOT gates). This leads 
to the implementation of and consequently 772. in 0 {n^) by using 0 {n^) 
gates. This represents not only the first practically feasible quantum circuit for 
implementation of 772. but also the first complete analysis of complexity of 
its time and space (gates) quantum implementation. Note that, both operators 
(72.-1 0 772i) and (772i 0 72._2i) can be directly and efficiently implemented by 
using the above algorithm and circuit for implementation of 772i. This implies 
both the feasibility and efficiency of the quantum implementation of the packet 
and pyramid algorithms by using our factorization for Haar wavelet kernel. 
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3.3 Daubechies Wavelet and Hoyer’s Factorization 

The Daubechies fourth-order wavelet kernel of dimension 2" is given in a matrix 
form as [22] 

/ Co Cl C2 C3 ^ 

C3 — C2 Cl —Co 

Co Cl C2 C3 
C3 — C2 Cl —Co 



£)(4) _ 
^2" — 



C2 C3 
\Cl -Co 



Co Cl C2 C3 
C3 — C2 Cl —Co 
Co Cl 
C3 -C2 J 



(26) 



where co = ci = C2 = and C3 = . For classical 

computation and given its sparse structure, the application of can be per- 
formed with an optimal cost of 0(2"). However, the matrix as given by 

(26), is not suitable for a quantum implementation. To achieve a feasible and 
efficient quantum implementation, a suitable factorization of needs to be 
developed. Hoyer [20] proposed a factorization of as 



D 



(4) 



^2" = (-f2'»-i ® C'i)5'2n(/2"-i ® C'o) 

where 

andOi = i 

-C2 C4 y 2 

and S'2" is a permutation matrix with a classical description given by 



Oo = 2 



Cfl. 

C4 

1 ^ 
C2 



S^j = 



1 if i = j and i is even, or if t -|- 2 = j (mod 2") 
0 otherwise 



(27) 



(28) 



(29) 



Hoyer’s block-level circuit for implementation of (27) is shown in Figure 6. 
Clearly, the main issue for a practical quantum implementation and subsequent 
complexity analysis of (27) is the quantum implementation of matrix S'2'*. To 
this end, Hoyer discovered a quantum arithmetic description of S'2" as 



^2 



jOn-l On-2 ' ' ' Oq) ' *■ j^n-l &n-2 ' ' ' &1 &0 



where 



b,= 



Ui — 2 (mod n), if t is odd 
a, otherwise 



(30) 



(31) 



As suggested by Hoyer, this description of 5'2" then allows its quantum imple- 
mentation by using quantum arithmetic circuits of [18] with a complexity of 
0{n). This algorithm can be directly extended for implementation of the op- 
erators (/2~-i and hence the packet algorithm. However, the feasibility 

and efficiency of an implementation of the operators (l2n-i © and thus the 
pyramid algorithm needs further analysis. 
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4 Fast Quantum Algorithms and Circuits for 
Implementation of Daubechies Wavelet 

In this section, we develop a new factorization of the Daubechies wavelet. 
This factorization leads to three new and efficient circuits, including one using 
the circuit for QFT, for implementation of Daubechies wavelet. 



4.1 A New Factorization of Daubechies Wavelet 

We develop a new factorization of the Daubechies wavelet transform by 
showing that the permutation matrix S' 2 ™ can be written as a product of two 
permutation matrices as 

= Q2~i?2" (32) 

where Q 2 ” is the downshift permutation matrix [16] given by 

/0 1 \ 

00 1 
0 0 0 1 

Q2" = ... . (33) 

0 0---0 0 1 

\1 0 ••• 0 0 0 / 

and i? 2 " is a permutation matrix given by 



R2r^ 



0 10 0 0 
1 0 0 0 0 
0 0 0 1 0 
0 0 10 0 




(34) 
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3o 





Fig. 7. A block-level circuit for implementation of new factorization of . 



The matrix can be written as 



R^n = l2n-l (g) N (35) 

where N = q^- Substituting (35) and (32) into (27), a new factorization of 

is derived as 

= (/2n-l 0 Ci)Q2-(/2"-i ® Af)(/2n-l 0 Co) = 0 Ci)g2»(/2-i C') 

(36) 

where 

c; = jv.c„ = 2(;';^»;) (37) 

Fig. 7 shows a block-level implementation of (36). Clearly, the main issue for a 
practical quantum gate-level implementation and subsequent complexity analy- 
sis of (36) is the quantum implementation of matrix ^ 2 "- In the following, we 
present three circuits for quantum implementation of matrix g 2 »- 

4.2 Quantum Arithmetic Implementation of Permutation Matrix 
Q 2 ” 

A first circuit for implementation of matrix ^ 2 " is developed based on its de- 
scription as a quantum arithmetic operator. We have discovered such a quantum 
arithmetic description of ^ 2 " as 

Q 2 " ■ |«n-l an-2 • • ■ ai Oo) I > \bn-l bn-2 ' ' ' &1 &o) (38) 

where 

bi = Ui — 1 (mod n) (39) 

This description of Q 2 " allows its quantum implementation by using quantum 
arithmetic circuit of [18] with a complexity of 0(n). Note, however, that the 
arithmetic description of Q 2 ” is simpler than that of S' 2 " since it does not involve 
conditional quantum arithmetic operations (i.e., the same operation is applied 
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to all qubits). This algorithm for quantum implementation of Q2^ and hence 
can be directly extended for implementation of the operators {12^-i 
and hence the packet algorithm. However, the feasibility and efficiency of an 
implementation of the operators ® D^2i) thus the pyramid algorithm 

needs further analysis. 

4.3 Quantum FFT Factorization of Permutation Matrix <52™ 

A direct and efficient factorization and subsequent circuit for implementation of 
Q2» (and hence Daubechies wavelet) can be derived by using the FFT algo- 
rithm. This factorization is based on the observation that (52" can be described 
in terms of FFT as [16] 

Q2" = F2nT2nF*„ (40) 

where T2« is a diagonal matrix given as T2«- = Diagjl, a;2», 

with a;2" = (* indicates conjugate transpose). As will be seen, it is more 

efficient to use the Cooley- Tukey factorization, given by (7), and write (40) as 

Q2" =F2„F2"T2nP2"Z;„ (41) 

It can be shown that the matrix T2n has a factorization as 

T2n = (G(w|r' )0/2"-i) • • • (/2i-i ®G(wir')0/2i-i) • • • (I2"~1 ®G(w2")) ( 42) 

where G(w2") = Diag{l, wf"} = ( n fc )• This factorization leads to an ef- 

^2" / 

ficient implementation of T2« by using n single qubit G{u>2’') as shown 

in Fig. 8. Together with the circuit for implementation of P2" (Fig. 4) and the 
circuit for implementation of FFT (Fig. 1), they represent a complete gate-level 
implementation of . 

However, a more efficient circuit can be derived by avoiding the explicit 
implementation of P2" by showing that the operator 

P2"T2"P2" = P2"(G(w|ra ) 0 72 "-i) ■ ■ ■ (d2>-i ® G(w|n ) 72»i-t) X 

• • • (/2"-i ® G(o;2"))P2" ( 43) 

can be efficiently implemented by simply reversing the order of gates in Fig. 
8. This is established by the following lemma: 

Lemma 1. 

P 2 n{G{toi:~") 0 hn-i) = (l2"-i 0 G{coi:~"))P 2 n (44) 

P 2 n{l 2 n-j G(w2 " ) ® ^23-1) = {l 2 i-^ ® G(w2 " ) ® /2"“.>)F2" (45) 

P2"(d2"-1 ® G(w2")) = (G(w2 ") ® /2 "~i)P2 " (46) 
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a 



n-2 



a 



n-1 




Fig. 8. A circuit for implementation of operator T 2 n. 



Proof. This lemma can be easily proved based on the physical interpretation of 
operations in (44)-(46). The left-hand side of (44) implies first an operation, i.e., 
application of G(w|n ), on the last qubit and then application of P 2 " on all 
the qubits, i.e., reversing the order of qubits. However, this is equivalent to first 
reversing the order of qubits, i.e., applying P 2 ", and then applying G(o;fn ), on 
the first qubit which is the operation described by the right-hand side of (44). 
Similarly, the left-hand side of (45) implies first application of G(w|n ^) on the 
(n — i)th qubit and then reversing the order of qubits. This is equivalent to first 
reversing the order of qubits and then applying G(w|n ) on the ith qubit which 
is the operations described by the right hand side of (45). In a same fashion, the 
left hand side of (46) implies first application of G{uj 2 ^) on the first qubit and 
then reversing the order of qubits which is equivalent to first reversing the order 
of qubits and then applying G(w|n ) on the last qubit, that is, the operations 
in right-hand side of (46). 

Applying (44)-(46) to (43) from left to right and noting that, due to the 
symmetry of P 2 " , we have P 2 " P 2 " = ^ 2 " , it then follows that 

P2"72"P2" = (d2>»-i <E)G((jJ2n )) ■ ■ ■ {l2'n—i(?)G{uJ2n )(8>/2*-i ) ' ' ' (G(w2'*)®.^2"-i ) 

(47) 

The circuit for implementation of (47) is shown in Fig. 9 which, as can be seen, 
has been obtained by reversing the order of gates in Fig. 8. Note that, the use 
of (47), which is a direct consequence of using the Cooley-Tukey factorization, 
enables the implementation of (40) without explicit implementation of P 2 "- 

Using (40) and (47), the complexity of the implementation of Q 2 ^ and thus 
is the same as of the quantum FFT, that is, 0{n^) for an exact implementa- 
tion and 0{nm) for an approximation of order m [15]. Note that, by using (47), 
(40), and (36) both operators (/ 2 "-> ®D^ 2 ^) and (P 2 ? can be directly 

implemented. This implies both the feasibility and efficiency of the quantum 
implementation of the packet and pyramid algorithms by using this algorithm 
for quantum implementation of 
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Fig. 9. A circuit for implementation of operator P2"72nP2" 



4.4 A Direct Recursive Factorization of Permutation Matrix <52™ 

A new direct and recursive factorization of Q 2 ” can be derived based on a simi- 
larity transformation of ( 52 " by using 772" as 

= (oI/V) 

which can be written as 

ni.Q,.n„ = ) = (JV8 e 

(49) 

from which ( 52 " can be calculated as 

Q2" =i72"(A^«)/2"-0(02"-i ©^2"-0^2" (50) 

Replacing a similar factorization of (52"-i into (50), we get 
(52" = 772" (A®72>i-1 )(7T2»i- 1 (A®72n-2 )((52"-2 ©72>i-2)7T2n-l ©72 >i- 1 )7T2n (51) 
By using the identity 

772"-iA772"-i 0 72"-i = (72 0 772"-i ) (A 0 72"-i ) (72 0 772"-i ) (52) 

for any matrix \ (51) can be then written as 

(52" = 772" (A^ 0 1"2"-i)(^2 0 772"-i)((A 0 72"-2) X 

((52"-2 0 72"-2) 0 72"-i)(72 0 772„_i)77|n (53) 

Using the identity 

(A 0 72"-2)((52"-2 0 72"-2) 0 72"-i = (A 0 72"-2 0 72"-i) 

((52"-2 0 72"-2 0 72"-i) 

= (A 0 72"-2 0 72"-i) 

((52"-2 0 73,2"-2) 



(54) 
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(53) is now written as 

Q2n =7T2n(iV®72n-i)(/2 0 7T2"-i)(A^®/2»-2©/2"-i) X 

(Q2n-2 © /2n-2»-2)(/2 © (55) 

Repeating the same procedures for all Q 2 P for t = n — 3 to 1, and noting that 
Q 2 = N , it then follows 

Q 2 " = 7T2n(-/V © /2n-l)(/2 © 7^2"-! )(-^ © © -?2"'-i) ■ ■ ' (-^2”-2 © -^^ 4 ) X 

(Af © 72 © h^-i){N © l2n-2){h^-2 © i7|) • • • (72 © 7r‘„_i)7r‘„ (56) 

The above expression of Q 2 " can be further simplified by exploiting the fact 
that (see Appendix for the proof) every operator of the form (72* © Il 2 n-i), for 
t = n — 2 to 1, commutes with all operators of the form {N®l 2 r>,-j ©72r._2»-3+i), 
for j = i to 1. Using this commutative property, (56) can be now written as 

Q2n = 772"(d2 © 7T2 "-i)('^ 4 © 7T2"-2) • • • (d2«-2 © © 72>i-l) X 

• • • (Af © 72 © © 72n_2)(72n-2 © 77*) • • • (72 © 77*„_077*„ (57) 

Using the factorization of P 2 " given in (5), we then have 

Q2n = P2"(iV©72n-i)(Af©72»-2©/2»-i) • • • (Ai©72©72n-4)(Af©72n-2)P2» (58) 

Substituting (58) into (36), a factorization of is then obtained as 

A>2» = (^2"-i © C'i)P2 »(A* © 72n-l)(A^ © 72n-2 © 72n-l) • • • (Af © 72 © 72n_4) X 

{N © 72n_2)P2n(72n-l © C') (59) 

Using Lemma 1, it then follows that 

= P2n(C'l © 72n-l)(A^ © 72n-l)(A^ ©72n-2 ©72n-l) •• • (Af © 72 © 72»-4) X 
(Af©72n_2)(C(,©72n-l)P2n (60) 

A circuit for implementation of , based on (60), is shown in Fig. 10. Together 
with the circuit for implementation of P 2 ", shown in Fig. 4, they represent a com- 
plete gate-level circuit for implementation of with an optimal complexity 
of 0(n). 

Using (60) and (19)-(20), the operators (72~-i © P^^^) can be directly and 
efficiently implemented with a complexity of 0(i). This implies both the feasi- 
bility and efficiency of the implementation of the packet algorithm by using this 
algorithm for wavelet kernel. However, this algorithm is less efficient for im- 
plementation of the operators (P^? ©72n_2t) and hence the pyramid algorithm. 
To see this, note that, the implementation of the operators (P2?©72n_20> by us- 
ing (60), requires the implementation of the conditional operators (P 2 « ©72n_2t). 
However, these conditional operators cannot be directly implemented by using 
(19) and (20). An alternative solution is to use the factorization of P 2 * in (5) and 
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Fig. 10. A circuit for implementation of by using recursive factorization of 

Q2"- 



the conditional operators (i72* © /2'*-20- However, this leads to a complexity of 
0{i^) for implementation of operators (P 2 ‘ © and hence the operators 

(F* 2 ? ® Therefore, while (60) is optimal for implementation of 

and the packet algorithm, it is not efficient for implementation of the pyramid 
algorithm. 

It should be emphasized that this recursive factorization of (52"? originated 
by the similarity transformation in (48) and given by (56) and (58), was not 
previously known in classical computing. Note that, the permutation matrices 
772" and, particularly, P 2 " are much harder (in terms of data movement pattern) 
for a classical implementation than Q 2 ^- In this sense, such a factorization of 
Q 2 " is rather counterintuitive from a classical computing point of view since 
it involves the use of permutation matrices 772" and P 2 " and thus it is highly 
inefficient for a classical implementation. 



5 Discussion and Conclusion 

In this paper, we developed fast algorithms and efficient circuits for quantum 
wavelet transforms. Assuming an efficient quantum circuit for a given wavelet 
kernel and starting with a high level description of the packet and pyramid al- 
gorithms, we analyzed the feasibility and efficiency of the implementation of the 
packet and pyramid algorithms by using the given wavelet kernel. We also de- 
veloped efficient and complete gate-level circuits for two representative wavelet 
kernels, the Haar and Daubechies kernels. We gave the first complete time 
and space complexity analysis of the quantum Haar wavelet transform. We also 
described three complete circuits for Daubechies wavelet kernel. In partic- 
ular, we showed that Daubechies kernel can be implemented by using the 
circuit for QFT. Given the problem of decoherence, exploitation of parallelism in 
quantum computation is a key issue in practical implementation of a given com- 
putation. To this end, we are currently analyzing the algorithms of this paper in 
terms of their parallel efficiency and developing more efficient parallel quantum 
wavelet algorithms. 

As shown in this paper, permutation matrices play a pivotal role in the devel- 
opment of quantum wavelet transforms. In fact, not only they arise explicitly in 



Quantum Wavelet Transforms: Fast Algorithms and Complete Circuits 



31 



the packet and pyramid algorithms but also they play a key role in factorization 
of wavelet kernels. For classical computing, the implementation of permutation 
matrices is trivial. However, for quantum computing, it represents a challeng- 
ing task and demands new, unconventional, and even counterintuitive (from a 
classical computing view point) techniques. For example, note that most of the 
factorizations developed in paper for permutation matrices 772", 72”, and Q 2 ” 
were not previously known in classical computing and, in fact, they are not at 
all efficient for a classical implementation. Also, implementation of the permuta- 
tion matrices reveals some of the surprises of quantum computing in contrast to 
classical computing. In the sense that, certain operations that are hard to imple- 
ment in classical computing are easier to implement in quantum computing and 
vice versa. As a concrete example, note that while the classical implementation 
of permutation matrices 772” and (particularly) P 2 ” is much harder (in terms 
of data movement pattern) than the permutation matrix ( 52 ", their quantum 
implementation is much easier and more straightforward than ( 52 ” . 

In this paper, we focussed on the set of permutation matrices arising in the 
development of quantum wavelet transforms and analyzed three techniques for 
their quantum implementation. However, it is clear that the permutation matri- 
ces will also play a major role in deriving compact and efficient factorizations, 
i.e., with polynomial time and space complexity, for other unitary operators by 
exposing and exploiting their specific structure. Therefore, we believe strongly 
that a more systematic study of permutation matrices is needed in order to de- 
velop further insight into efficient techniques for their implementation in quan- 
tum circuits. Such a study might eventually lead to the discovery of new and 
more efficient approaches for the implementation of unitary transformations and 
therefore quantum computation. 
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Appendix: Commutation of the Operators 72i O 772n-t with N 0 

72ra_2”-3 + i 

We first prove that every operator of the form 72t ® 772n-t, for 7 = n — 2 to 
1, commutes with all the operators of the form 7V0 72n-3 0 72n_2”-3+i, for j = i 
to 2, by simply showing that 

(72* 0 IJ2n-i){N 0 12^-3 0 72n_2(n_j_|_l) = (TV 0 l2n—j 0 72n_2”-:» + l)(72* 0 Il2n-i) 

( 61 ) 

The matrix / 2 * ‘H) 712^-^ is a block diagonal matrix and therefore can be written 
as 

l2i 0 = I 2 ^ Il2n-j 0 I 23—2 ^ n2f^-3 ( 62 ) 
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It can be then shown that 

(/2<8>I72>i-3 (B l2i —2^ ^2^-3 ){N<S) l2’'-3 ©/2" — + i ) = N <^Il2n.-j (B I 23 —2^ ^2^-3 

( 63 ) 

and 

l2^-^ 0 / 2 ^ — + i ) (-^2 0-^2”- -J 0-^2-? —20-^2”- -J ) “ A^0-/T2”'“J 0-^2-? — 2 0-^2^~.? 

( 64 ) 

It now remains to show that every operator of the form 12* © 112**-* commutes 
with the operator N © l 2 **-* ■ This is simply proved by first using the fact that 

I 2 * © II 2 **—* — I 2 ® (-^ 2 *”^ © IT 2 **—*) (^^) 



and then showing that 

{h © {h*-* © n2**-*)){N © l2**-i) = (iV © l2**-l){l2 © (/2*-i © ^2»-0) 

= iV © l2*-i © i72'*-i (66) 
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Abstract. Presented is a quantum lattice gas for Navier-Stokes fluid 
dynamics simnlation. The quantum lattice-gas transport equation at the 
microscopic scale is presented as a generalization of the classical lattice- 
gas transport equation. A special type of quantum computer network is 
proposed that is suitable for implementing the quantum lattice gas. The 
quantum computer network undergoes a partial collapse of the wave- 
function at every time step of the dynamical evolution. Each quantum 
computer in the network comprises only a few qubits, which are entan- 
gled for only a short time period. A Chapman-Enskog type analysis of 
the quantum computer network indicates that the total system of qubits 
behaves exactly like a viscous lattice-gas fluid at the macroscopic scale. 
Because of the quantum mechanical nature of the scattering process, 
superposition of outgoing collisional possibilities occurs. The quantum 
lattice gas obeys detail balance in its collisions and is therefore an un- 
conditionally stable algorithm for fluid dynamics simulation. 



1 Introduction 

Prior to the advent of digital computing in the late 1940’s, analog computers 
held much promise. An electrical circuit can be constructed to simulate, say, 
an underdamped oscillator governed by a second order differential equation. For 
example, an electrical circuit can drive a trace on an oscilloscope mimicking the 
vertical motion response of a fast moving automobile with poor shock absorbers 
after passing over a speed bump — one continuous physical system configured to 
behave just like another continuous physical system. Today, after five decades 
of digital computing, history may repeat itself in the sense that it may again be 
worthwhile to build “analog” computers — for example, a quantum mechanical 
spin system configured to behave just like a Navier-Stokes fluid. 

The purpose of this paper is to show how to do this. We show how to arrange 
a network of small quantum computers so that, taken as a system, the qubits 
within the network mimick the behavior of a system of massive quantum particles 
moving and colliding on a discrete spacetime lattice. This discrete quantum 
particle system is termed a quantum lattice gas and the associated quantum 
computer network is called a lattice-gas quantum computer. 

Over a decade ago, classical lattice gases were found that behave like a viscous 
Navier-Stokes fluid at the macroscopic scale m- In this paper we show that 

* This work is supported under Task No. 2304CP of the Air Force Office of Scientific 
Research, Mathematical and Computational Sciences Directorate. 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 34- ^T71 1999. 
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a quantum lattice gas does too. The prediction of the quantum lattice gas’ 
macroscopic equations of motion is achieved by a generalized Chapman-Enskog 
analysis. A property of the quantum lattice gas is that continuous macroscopic 
fields for the mass density and momentum density are obtained without the 
need for either ensemble averaging or coarse-grain spacetime averaging, both of 
which are computationally expensive in a classical lattice-gas simulation. This 
computationally useful property of the quantum lattice gas arises because it 
models the discrete particle system directly at the mesoscopic scale, avoiding 
noisy fluctuations while retaining detailed balance in the local particle collisions 
0. Detailed balance is satisfied because of the unitary action of the collision 
operator as it causes quantum mechanical superpositions of outgoing particle 
configurations at each site of the spatial lattice. Consequently, the quantum 
lattice gas is unconditionally stable as a numerical algorithm. 

We calculate the single-particle distribution function analytically for a quan- 
tum lattice-gas system at local equilibrium. The analytical prediction is that 
it has the same form as the single-particle distribution function of a classical 
lattice-gas system, which also obeys the principle of detailed balance. We var- 
ify this prediction through numerical simulation of a two-dimensional quantum 
lattice gas, which is a straightforward generalization of the classical FHP lattice 
gas PI For comparison purposes, we also include results from a classical FHP 
simulation. In the low Mach number incompressible fluid regime, there is excel- 
lent agreement between the theoretical prediction and the numerical data for 
the single-particle occupation probability. 



2 Review 



There are new possibilities and limitations that arise in computing if we use the 
principle of quantum mechanical superposition of states |4l5ltil7liSft)j . In quantum 
computing a two-level quantum bit represents the smallest unit of information 
which may be in a superposition of the discrete states |0) and |1)Q An example 
of the physical embodiment of a qubit is the z-component of a nuclear spin in 
an atom in a uniform external magnetic field 0 

An open issue for quantum computing is whether or not entangled qubit 
states (of many qubits, much more than two) within the quantum computer’s 
Hilbert manifold can be isolated from the surrounding environment to a suf- 
ficient degree for delicate quantum algorithmic steps to be completed. Using 
quantum mechanical superposition among qubit states to speedup a computa- 
tion by simultaneously encoding many possibilites, an approach termed quantum 



^ A qubit, |g) = a|0) + j3\l), has an amplitude, a, of it being in the zero state, |0), and 
another amplitude, /3, of Ending it in the one state, \ 1) . The probabilities add to unity: 
(0|0) -I- (1|1) = {q\q) so the complex coefficients are constrained by |ap -I- |/3p = 1- 
^ Cory et al. have employed the quantum number of a nuclear spin of an atom in 
a molecule of a liquid placed in a strong external magnetic field to encode a single 
qubit and they used nuclear magnetic resonance to control its state and interaction 
with qubits in neighboring atoms within the same molecule IH3- 




36 



J. Yepez 



parallelism, is generally considered the primary virtue of quantum computation!^ 
Yet uncontolled coupling with the surrounding environment causes decoherence 
of the qubit states and the virtue is lost — quantum parallelism levies a high price 
for coherence of the quantum computer’s wave function. This has spured the de- 
velopment of scalable quantum error correction techniques, considered crucially 
important for the enterprise to continue I11ISI12I13I . Because of the difficulties 
of quantum coherence, the first quantum computer comprised only two qubits. 

An historical starting point that led to quantum computing was reversible 
computing M- Since microscopic physics is reversible^, it is believed that quan- 
tum mechanical algorithms must be too.0 Reversible algorithms for simulating 
physics on a quantum device can serve as a guide for constructing the device. The 
common assumption is the quantum mechanical device itself undergoes unitar 
(and therefore reversible) evolution as it transitions through its “computation’ 

For any reversible computation, one can describe the algorithm by specifying 
a unitary evolution operator, formally written as acting on the system 

wavefunction, |!F), which constitutes the state of the quantum computer’s “mem- 
ory”. With N qubits, the quantum state |<F) resides in an exponentially large 
Hilbert space with 2^ dimensions. A new quantum state, is generated by 
application of a unitary matrix of size 2^ x 2^ as follows 






JHr/h 



m- 



( 1 ) 



By repeated application of an ordered sequence of states is generated and 

each one is given a unique time label. If the first state is labeled by t then the 
next one is labeled by f -|- r, and the next by f -|- 2r, and so forth. With this 
understanding we write as 

|iZ/(t + r)) (2) 



In this way the computational time advances incrementally in unit steps of du- 
ration r. Of course the state of the quantum computer exists at all intermediate 
times, say at f-l- but for our purposes we need only use the state at intervals of 
the time step r. The quantum computer’s evolution is invertible by application 
of the adjoint of the evolution operator 

I iF(t-r)) I iF(t)). (3) 

® In lattice-gas quantum computation, quantum parallism is used to allow for simul- 
taneous multiple collision possibilities at each site of the lattice. This allows for a 
reduction in the viscosity of the fluid which improves the computational efficiency. 
Yet the computational efficiency is also due to the continuous phase of the qubit, 

I q) = cos 8\ 1) -hsin 0| 0) , which we use to represent the probability of finding a particle, 
fa = cos^ 9. 

^ Provided photons do not escape to infinity. 

® In this paper we use irreversibility, for practical purposes, in part of the quantum 
mechanical algorithm. 

® By restricting oneself to reversible algorithms, in principle heat production may be 
avoided altogether 
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This computational picture is consistent with the Schroedinger equation of quan- 
tum mechanics. For any reversible algorithm chosen, the task is to map the com- 
putational evolution operator of the algorithm onto the dynamical evolution of 
interacting qubits of the physical device. 



3 Quantum Lattice Gas 

Lattice-gas quantum computation uses the superposition of multiple qubit states 
within a small spatial region of size i only for a short amount of time on the 
order of the duration of a single time step, r. A lattice-gas quantum computer 
has qubits arranged in a lattice based array with a small group of qubits at each 
site of the lattice. Each site of the lattice can be thought of as a small quantum 
computer and all the quantum computers are connected in a lattice network. 
The quantum lattice gas’ evolution can be formally expressed as a special case 
of (0) as follows 



I !f(a;i,...,a;v;t-hT)) = SC \ 'F{xi, . . . ,xv;t)), (4) 

where S' is a unitary streaming matrix and (7 is a unitary collision matrix and 
where we have explicitly labeled the wavefunction’s dependence on all the co- 
ordinates of the lattice. The operator C causes mixing of the outgoing collision 
configuration at each site of the lattice, locally entangling the qubit states within 
a lattice cell of radius size, £. The operator S causes qubits to move from one 
site to the next, by exchanges between nearest neighboring sites (it is identical 
to its classical counterpart). Each qubit moves with unit speed, c = ^, along 
one of the lattice directions, Ca- Hence, in a completely coherent quantum com- 
putation, the application of S causes global entanglement of the all the qubit 
statetQ. It remains an intractable problem to theoretically analysize the dynam- 
ics of a quantum computer with many qubits because of the exponentially large 
size of the Hilbert space in which the entanglement occurs. And to make mat- 
ters worse, even if a quantum computer was constructed with a large number of 
qubits, its wavefunction would decohere by uncontrolled entanglement with the 
external world and we know of no way to mitigate against this. So constructing 
a large coherent quantum computer is difficult, if not all together impossible, 
and predicting its behavior by analytical means is intractable. 

So what can be done about this? What I would like to consider is a simplifi- 
cation that will sidestep these obstacles and give us two important advantages: 
(1) a simple way to use a quantum computer with a large number of qubits; 
and (2) a way to analysis of its behavior. In lattice-gas quantum computation 
complete coherence of the wavefunction is not needed for the algorithm to work. 
In fact, we assume entanglement of qubit states is only among small clusters of 
qubits in a localized nearby neighborhood, so independent quantum operations 

^ Mathematically speaking, this is because both S and C cannot be simultaneously 
diagonalized 
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are done in a classically parallel fashion on all sites simultaneously. This is the 
collision step. 

In a deterministic classical lattice gas, the collision operator is a permutation 
matrix with components being either zero or one. In a probabilistic classical 
lattice gas, the collision operator is a transition matrix with real valued com- 
ponents. In contrast, in a quantum lattice gas, the collision operator can be a 
unitary matrix with complex components. The collision process is in general 
irreversible because a projection of the quantum computer’s wavefunction into 
a tensor product state over the qubits is periodically made causing the wave- 
function to partially collapse. Hence, application of S does not cause any global 
entanglement. 

The quantum lattice gas presented here should not be confused with previ- 
ous quantum lattice gas models by Sued ^7|, Boghosian or Yepez |0| for 
simulating quantum mechanical systems. Despite some similarities, the type of 
quantum lattice gas treated in this paper is a direct generalization of a classi- 
cal lattice gas with quantum bits replacing classical bits. In fact, if othogonal 
permutation matrices with 0 and ±I components are used for the collision pro- 
cess and in the limit of complete collapse of the lattice-gas quantum computer’s 
wavefunction, the quantum lattice gas exactly reduces to a classical lattice gas. 
This particular feature distinquishes the quantum lattice gas for fluid simulation 
from the quantum lattice gases for quantum mechanical simulation. 



Table 1. Model Constants 



Constants 


Names 


1 


length unit 


T 


time unit 


m 


mass unit 


c 


velocity unit 


D 


spatial dimension 


B 


lattice coordination number 


G-ai 


unit lattice vectors 


a 


directional index (1,2,. . . , B) 


i, j, k, 1 


spatial indices 



4 Preliminaries 

Consider a lattice-gas quantum computer with the following properties: 

— M is the number of lattice sites 

— B is the number of qubits per site (and the number of nearest neighbors) 

— N = VB is the total number of qubits 

— 2^ is the size of the full Hilbert space 
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Table 2. Wavefunction Symbols 



Symbol 


Size of Manifold 


Description 


W 


2iv 


Total system wavefunction 




2® 


On-site ket 


UJ 


B 


Partially collapsed on-site amplitudes 


q 


2 


Qubit ket 




Fig. 1. An array of small quantum computers (the quantum computers are depicted 
as circles) arranged in a 2-dimensional triangular lattice (B=6). The large circle on the 
right is an expanded view of a single quantum computer which is one site of the lattice. 
It depicts the on-site submanifold, TL. Each quantum computer at a lattice node has 6 
qubits so the on-site ket | i/>) resides in a 64-dimensional Hilbert space. Each node is 
coupled to its 6 nearest neighboring quantum computers by a mechanism allowing for 
the exchange of a single qubit. 



— 2^ is the size of the on-site submanifold, denoted Ti, 

— B is the size of the reduced on-site submanifold, denoted B 

We will use the following convention for indices: 

— Small roman letters (a, 6, c) for the K-space dimensions, a G {1, . . . , B} 

— Greek letters (a, /3, 7) for the 7f-space dimensions, a G {0, . . . , 2^ — 1} 

— Middle roman letters (i, j, k) for the spatial dimensions, i G {1, . . . , D} 

The full Hilbert space of size 2^^ is partitioned into V independent quantum 
manifolds of dimension 2^, as depicted in FigH Quantum superposition of states 
occurs only within each 2^-dimensional subspace, denoted Ti.. A general on-site 
ket defined over the basis states of Ti, is the following 
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tpa{x,t) I a) = 



q ;=0 



/ V'o \ 
V'l 

\V'2®-l/ 



( 5 ) 



The ket | 'tp) is specified by 2^ complex amplitudes, denoted ipQ , The 
quantum computer’s total wavefunction is formed as a tensor product over all 
the 7f-manifolds 

y 

\W{xi,...,xv]t)) = ^\ il}{x,t)). (6) 

X — 1 

The collision operator, C, is blocked over all the 7f-manifolds. That is, the coll- 
sion matrix is block diagonal with V blocks each of size 2^ x 2^, and therefore 
can be written as a tensor product 



y 

C = (g)C/. (7) 

X — 1 

The on-site collision matrix, U, is unitary and acts on the on-site ket 

\pj\x,t)) = IJ \pj{x,t)). ( 8 ) 

The prime on the L.H.S. of 10) indicates that the ket is an outgoing collisional 
state. 

5 Unitary Collision Matrix 

Let Qa be matrices representing the conserved quantities in the single speed 
quantum lattice gas, Qa = (Qo,Qi) where i is an index over the independent 
spatial coordinates. A fundamental property of a quantum lattice gas is that the 
mass density and the momentum density can be written as follows 

p = {ip \ Qo \ Ip) (9) 

pv^ = {ip\Q, \ ip), (10) 

that is, where the component of Qo are 

B 

a—1 

and where the component of Qi are 



B 

a—1 



( 12 ) 
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The bf^a denotes the a“*-bit of the ket. The Ba here denote the unit lattice 
vectors where a = The components of the 2^ x 2^ qubit number 

operator are defined by 

In terms of C3), the operators for mass and momentum are 



B 

Qo = m'^fla (14) 

a—1 



and 



B 

Qi = mcy^egihg. 

a—1 



(15) 



In terms of m the invariant quantities are simply expressed as the following 
matrix elements 



B 

P = ^to(V' I ha I lA) (16) 

a—1 

B 

pvi = y^^mcegi{ip I fig I Ip). (17) 

a—1 

The matrix elements (jOI) and m must remain constant after each time step 
iteration 

{ip{t + t) \Qa\ ip{t + r)) = {ip{t) I Qa I ip{t)). (18) 

Since | tp{t + r)) = [/ | ip{t)), this implies that 

U^Qc.U = Qc, (19) 

which is just the commutator 

[U,Q^] = 0. (20) 

The matrices Qa must commute with U. 

Let g denote the generator of U 

( 21 ) 

where e is an “Euler angle” . Consider a “rotation” through an infinitesimal angle 
e so that C/ can be Taylor expanded to first order as 



U =\ + ieg. 



( 22 ) 



The unitary condition, WU = 1 , implies that the generator is hermitian 



g- 9^ =0 + C>(e^). 



(23) 



42 



J. Yepez 



From ll I ^)ll . we see that mass and momentum conservation is ensured provided 

Qa5-5^Qa = 0 + O(£2). (24) 

The solution of the set of linear equations (filtll and (I24|l give the Lie algebra for 
the unitary group. Therefore, the mass density (EJ and the momentum density 
m are conserved when each equivalence class block of the collision operator is 
an element of the unitary group U{n) where n is the size of the equivalence class 
of the incoming local configuration. This is an important feature of a quantum 
lattice gas. Since any member of the unitary group can be used, the quantum 
lattice gas is algorithmically robust. 




Fig. 2. The equivalence classes for the quantum FHP lattice gas. 



An equivalence class is defined as a set of basis states that correspond to 
particle configurations with the same mass and momentum. The unitary collision 
operator, U, acting on the 2^ dimensional 7f-manifold itself is block diagonal 
over the equivalence classes. For example, there are two equivalence classes for 
the FHP lattice gas [ 2 ], see Fig0 The first equivalence class is comprised of the 
following two-body kets 



|9) = 1001001) 
|18) = 1010010) 
|36) = jlOOlOO) 
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with mass, m = 2, and zero momentum, p = 0. A general ket in this mass- 
momentum sector of the on-site manifold is a linear combination of these 

a|100100) -k /3|010010) -k 7IOOIOOI), (25) 

where a, j3, and 7 are complex numbers. The second equivalence class is com- 
prised of the following three-body kets 

| 21 ) = 1010101 ) 

|42) = jlOlOlO) 

with mass, m = 3, and zero momentum, p = 0. A general ket in this 
mass-momentum sector is a linear combination of these 

p|101010)-ki^|010101). (26) 

So U for a two-dimensional quantum lattice gas on a triangular lattice has two 
blocks, a U (3) block for mixing the 2-body configurations and a U (2) block for 
mixing the 3-body configurations. 

For the triangular quantum lattice gas, we have 

e*^cos?7 sinyy \ / ^>21 \ 
e~'^^ cost] J \1p42 J ’ 

where zero momentum three-body configurations are mixed by a unitary matrix, 
U{2) = [/(I) 0 SU{2), which in general has four free parameters. The zero 
momentum two-body configurations are mixed by a unitary matrix, C/(3) = 
U{1) 0 S'[/(3), which in general has nine free paramter^ 

f ^'9 \ ^ ( ^9 \ 

Uis = e*^5C/(3) U18 . (28) 

W 3 J VV'36/ 



6 Partial Collapse of Post-Collision Ket | ?/?') 



To avoid causing any global entanglement as induced by streaming, we project 
the post-collision ket | ip') which resides in the 2^-dimensional H manifold onto 
a smaller B-dimensional submanifold, B, using a projection operator, denoted 
r, as follows 



\u;')=r\ip') 



/^i\ 
, ,/ 



(29) 




The operator F causes a partial collapse of the locally entangled on-site state \ip) 
resulting in a nonentangled state |w) residing in a smaller manifold (a mapping 



® We do not write out the SU(3) matrix in component form because it is too compli- 
cated. 
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%2 |iiiiio> Injection 

^63 




Fig. 3. Two-dimensional quantum lattice gas on a triangular lattice. The lattice co- 
ordination number is B = 6. There are 2® = 64 amplitudes in the 7t-manifold and 
6 amplitudes in the B-manifold. Injection maps the 6 on-site amplitudes uJa in the 
B-manifold into the larger 7i-manifold. The inverse process, projection, maps the 64 
amplitudes ija in the 7t-manifold onto 6 amplitudes u>a in the B-manifold. The projec- 
tion is a measurement process that causes a partial collapse of the on-site wavefunction 
I ip{x,t)}. The partially collapsed wavefunction is | uj{x,t)). 



from 64 dimensions down to 6, see FigOJ- Thus jw) in (121)11 may be termed the 
collapsed post-collisional on-site ket. By construction, the action of F fixes the 
phase, 0a, of the on-site qubits | qa) according to the following recipe 

I Qa) = cos 9a \ 1) + sin 6»a | 0), (30) 

where u>a = cos 9a- That is, 

I Qa) = Wa I 1) + V^l - | 0). (31) 

After the collapse of the ket | tp) the single-particle occupation probability, de- 
noted fa, is a well-defined quantity. It is the probability of finding a particle at 
coordinate {x,t) with momentum mcca 



fa{x,t) = U}*a{x,t)uJa{x,t). 



(32) 



Using the single qubit number operator h 
terms of the qubit ket as 



0 

0 



, ll,42ll can be written in 



fa{x,t) = {qa{x,t) I n I qa{x,t)). (33) 

Furthermore, in (Eni) and o, the matrix element {ip \ fia \ 4’) also gives the 
probability of particle occupancy. So for a quantum lattice gas, fa can also be 
expressed as the matrix element of the multiple qubit number operatoi|3 

fa= i'ip\na\ Ip). (34) 

® In a classical lattice gas the single-particle occupation probability is obtained by 
ensemble averaging over the number variables, fa = {ua}, where = 0 or 1. 
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Inserting the expression for the outgoing collisional state (0 into the R.H.S. 
of 023, we have 

I u;') = rU I V-). (35) 

In we will use a nonlinear function for the projection operator. Our goal 
is to retain as much quantum information as possible while allowing (1551) to 
reduce to the collision equation of classical lattice gas transport when is a 
real-valued permutation matrix. This is accomplished by projecting down from 
the 7i-manifold containing 2^ complex amplitudes to the S-manifold with only 
B complex amplitudes. Each tUa (or associated qubit | qa)) is “attached” to one of 
the lattice directions. The reason for this reduction of the quantum information 
is that by using only B complex amplitudes, one for each direction, we can 
straightforwardly write down a quasi-classical streaming equation in analogy to 
the streaming equation of a classical lattice gas 

\u>{x + iCa,t + r)) =\u>'{x,t)), (36) 

where Labi = ^aiSab- Inserting into (E3, we have 

\uj{x + iCa,t + T)) = ru\ip{x,t)). (37) 

The only task remaining to complete the analogy to classical lattice gas dynamics 
is to rewrite the R.H.S. of (1371 solely in terms of the w’s. This can be done by 
injecting the on-site collapsed ket | oj) residing in the sub-manifold B up into 
the larger on-site manifold Ti. (see Fig0. This process can be expressed by 
application of an injection operator, I, as follows 

\ip{x,t)) = i \ uj{x,t)). (38) 

A straightforward way to accomplish the injection is to take the tensor product 
over the on-site qubits 



B 

\ilj{x,t)) ^<^\qa{x,t)). (39) 

a—1 

This is a nonlinear operation^ Let us revisit the example a two-dimensional 
quantum lattice gas on a triangular lattice (R = 6), a generalization of the clas- 
sical FHP lattice gas m. Fig0 illustrates projection from the 64-dimensional 
7i-manifold down to the 6-dimensional R-manifold and illustrates injection from 
the R-manifold up to the 7i-manifold. The 6 on-site amplitudes u>a (or the as- 
sociated 6 on-site qubits | qa)) generated by the projection can be streamed in 
a classical fashion. After streaming to their new sites, each quantum computer 
has a new incoming configuration of the LOa amplitudes. Before this configuration 



10 



A non-square linear matrix conld also be used, but it is difficult to find an appropriate 
matrix even though it can be shown that one exists. 
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can be collided, they must be injected up to the larger 64-dimensional manifold 
where the collision process is well defined. 

Inserting into fTTHl gives 

\J{x,t)) = rui\uj{x,t)). (40) 



Using the fact that the projection of the injection is the identity operation: 
\ uj) = ri \ u>) , we write (141)11 in a form analogous to the classical lattice gas 
collision equation 



I =1 u}{x,t)) + 



rui I uj{x,t)) 



ri I uj{x, t)) . 



(41) 



Finally, we arrive at the quantum lattice gas microscopic transport equation by 
inserting (sa into lllioll 

I Uj{x + ita, t + r)) =\ io{x, t))+ I n{x, t)), (42) 

where the quantum lattice gas collision operator is defined as 

I f2{x,t)) = riii I io{x,t)) — ri \ oj{x,t)). ( 43 ) 

In component form, 1141^11 is 

UJa{x + iCa, t+r) = U!a{x, t) + fia {uJ*{x, t)) . (44) 

Equation (EH is identical in form to the classical lattice gas transport equation 
where the occupation variable, ria = 0 or 1, is replaced by a complex amplitude, 
0 < \oJa\ < 1, that continuously encodes the square root of the probability for 
particle occupancy. Hence (I44II is a much more useful expression of the quantum 
lattice gas dynamics than (EJ is. 



7 The Projection Operator 

We can write an analytical expression for the projection operator where the 
amplitudes Ua a nonlinear function of the amplitudes 'i/^o 






2^-1 






1 



I t/'a p baa 

a—0 



(45) 



Note that baa = 0 or 1 is the Boolean value of the bit of the 0 “* ket in the 
number representation. Let rh and pi be the operators for mass and momentum 
in the K-space. Then the matrix element for the mass density is 



p= (w I m I w). 



( 46 ) 



where rhab = and the matrix element for the momentum density is 
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pvi = {lu \pi\ w), 



( 47 ) 



where {pi)ab = mcCaiSab- Qo and Qi were defined in m to be the mass and 
momentum operators in ^-space. Here we have mass and momentum operators 
in w-space. The matrix element (0 defines the mass density as p = {ip \ Qo \ 
Ip), where {Qo)ap — rnJ2a=i^aaSap, and the matrix element m defines the 
momentum density as pVi = {ip \ Qi \ ip), where (Qi)ap = mcY^^^^baaBaiSab- 
Equating (gnj with @ and equating (EH) with gives us a way to check the 
projection operator dH- 
This is done as follows 



B 

mY^ 1 Wa p 


(48) 


a—1 




B 2® 




y] 1 P baa 


(49) 


2® / B \ 




X! 1 P 1 I , 

q:= 1 V a=l / 


(50) 



where we used the square of (03) on the second line of the derivation. Therefore, 
we have 

{uj \ m \ uj) = {ip \ Qo \ Ip), (51) 

where the mass operator in ^-space 

B 

(0o)a/9 — ^ ^ ^ ^ao^a/9 
a=l 



is identical to Qo defined in (gj. So the projection operator 03 conserves mass. 
We continue the consistency check by rewriting 03 



B 

'pj \ Pi \ Lo) = mc'Y \ p eai 

a—\ 


(53) 


B 2® 

= me y] y] 1 V'a P baa&ai 


(54) 


2® / B \ 

— ^ ^ 1 Ip a 1 f TblC ^ bgaCgi 1 , 
a=l V a=l / 


(55) 



where again we used the square of (14,411 on the second line of the derivation. 
Therefore, we have 

{uJ\p^\uJ) = {1p\Q^ \ Ip), (56) 

where the momentum operator in ^-space 
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B 

{Qi^a0 — TTIC ^ ^ 
a—1 

is identical to Qi defined in (ma). So the projection operator (I4t>ll conserves 
momentum as well as mass. 

8 Equilibrium Ansatz 

The uja amplitudes in S-space can be ordered in powers of e as follows 

|w)=|a;(°))+£|u;«) + 0(£2), (58) 

where | denotes the equilihrium ket and where the ket | is the first or- 
der correction from equilibrium. The condition equilibrium is that | satisfies 
the following identity 

I w(o)) = T[// I w(o)). (59) 

Note that the associated equilibrium ket in 7f-space, | follows from (r>iSll 

by injection \ -ip) = I \ to) 

\ tp) =\ + e \ + 0{e'^). (60) 

So we can also write as follows 

I ip(o)^ = U I ip^°'>). (61) 

It is clear that | tp^^'>) is an eigenvector of U with unity eigenvalue. Using . 
we immediately see that the collision operator vanishes at equilibrium 

I f 2 (w(°))) = rui I - ri \ = o. (62) 

Equations llPiSll and llPtill constitute the essential ansatz that will allow us to 
perform a Chapman-Enskog analysis of the quantum lattice gas. It is possible to 
analytically solve llPtill for | Knowing the form of | we can predict the 

hydrodynamics equations of the quantum lattice gas at the macroscopic scale. 
In the Chapman-Enskog analysis, we expand the collision operator, | 17), about 
this equilibrium ket | In so doing, the Jacobian of the collision operator is 

computed as a first order correction and is evaluated at | w) =| Since the 

transport coefficients for the mass diffusion, shear viscosity, and bulk viscosity 
depend on the value of this first order correction, and this in turn depends on 
the value of | one must determine the equilibrium amplitudes in order to 

compute the value of the transport coefficients. 

The a single particle occupancy probability fa = = {q^'^ \ h \ 

= {ip \ fi a \ Ip) has the functional form 



gOtp+0ea-p+-tE _j_ 2^ ’ 



( 63 ) 
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0.05 0.1 0.15 0.2 0.25 0.3 



^ _ Velocity 

^3 ^2 





O Quantum lattice gas 
X Classical lattice gas 



(b) 



Fig. 4. Theory versus simulation comparison of the velocity dependence of the single- 
particle distribution function in the non-Galilean parameterization: fa = uia^a = d + 
dDia ■ V + gdD{D/2 + l)Qa '■ w. FHP simulation data is overplotted on this predicted 
mesoscopic distribution function. Plots (a) and (b) are for background densities of 
d = .20 and d = 0.25, respectively. A velocity shift is imparted along the a;-axis; that 
is, along the fi direction indicated in the figure. Data was collected from a 128 x 128 
classical FHP simulation (crosses) and was coarse-grained averaged over 1600 time 
steps from time step t = 400 to t = 2000. Data was also collected from a smaller 
32 X 32 quantum FHP simulation (circles) and were measured at a single time step at 
t = 200. 



where the argument of the exponential is a linear combination of the conserved 
scalar quantities: ( 1 ) the mass p; ( 2 ) the momentum component Ca ■ p along 
the lattice direction Ca] and (3) the energy if at a lattice site. The real valued 
coefficients a, (3, and 7 are free parameters that are fixed by the non-Galilean 
parameterization given in Appendix El The reason for the form of Hb.’ll) is the 
collision matrix U is unitary and so the collisions obey detailed balance. 

In the quantum limit, where the quantum lattice gas becomes a fully coher- 
ent quantum system that undergoes unitary evolution, we expect the equilibrium 
probability for the particle occupancy to have the form of m- In the opposite 
limit, the classical limit, where there is a complete collapse of the ket | 'P) every- 
where, the quantum lattice gas reduces to a classical lattice gas system. In the 
classical limit too, the particle occupancies are described by ( 0 . Our quantum 
lattice gas dynamics is somewhere midway between a fully quantum system and 
a fully classical system. Since both ends of the spectrum are described by (ES) > it 
is not altogether unexpected that the middle regime is too. However, this is not 
obvious. At first glance, it appears that the destructive act of projecting from 
the 7i-manifold onto the ;B-manifold might destroy the form of the equilibrium 
distribution. But it does not (see Fig0); the distribution (ICTl describes the par- 
ticle occupancy of our quantum lattice gas 0. Taking uja = \J {q'a'’ | h | 
the result (see Appendix EJ is the followinsF^ 



In 16411 . we have used the approximation y/T+~x = 1 -|- ^, which holds for small x. 
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u>) = Vd { \ 1) + ^v- I e) + vv :| Q) 



2c 



4c^ 



where 



1) = 



and 



1 
1 
1 

\iJ 

e) = C\ 1), 



Q) = (££--) I 1). 



Taylor expanding the collision operator gives 

dila (w) 



Qa (w) = 



dujb 



CO 



( 1 ) 



0(6^). 



j(0) 



( 64 ) 



( 65 ) 

( 66 ) 

( 67 ) 

( 68 ) 



Using the equilibrium condition m, the first term on the R.H.S. vanishes and 
we are left with 

(w) = eJab^b ^ + C*(^^)) (69) 

where the Jacobian of the projection operator is defined as 

dQa (w) 



— 



dcob 



j=w(o) 



Equation (EHJ can be written in vector form as 

I f2{co))=ej\io^'^'>) + 0{e^). 



( 70 ) 



( 71 ) 



9 Boltzmann Equation for the Quantum Lattice Gas 

The quantum lattice gas microscopic transport equation J22I) is 

I co{xl + etc, t + e^r)) =| uj{x, t))+ \ f2{x, t)). 

Near equilbrium, each application of the projection operator causes only small 
changes in the phase of | to) allowing us to Taylor expand the L.H.S. of (P|). 
Taylor expanding m to second order in e gives a Boltzmann equation 

e'^dt I co{x,t)) + ^ecCidi + \ co{x,t)) + 0{e^) = ^ | f2{x,t)). 

( 72 ) 

In analogy with a classical lattice, we impose two constraints on the collision 
operator regarding the isometries of the lattice. The first constraint is 



(w I m I 17) = 0, 



( 73 ) 
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and this will be needed when we take the zeroth moment of iirzi) . This con- 
straint enforces mass conservation. The second constraint to enforce momentum 
conservation is 

(w I M f?) = 0, (74) 

and this will be needed when we take the first moment of (E2I). Inserting the 
e-expansion of | oJa), (1^ . and the e-expansion of | f 2 a), (ED, into the quantum 
lattice gas transport equation, (E2J, and keeping terms up to second order in e 
gives 

2 2 

e'^dt I -1- -Cidi I + —Lidi \ -I- ^Cit^didj \ = 

T T ZT 

-J \ + (75) 

r 

Now equating the order-e terms, we get 

J I = CA I (76) 

Then inverting the kinetic part of the Jacobian matrix gives 

I o;(i)) = CA \ (77) 

So the first order correction ket is equated to the gradient of the equilibrium ket . 
Inserting this result back into (EHI) we have 

I w) =1 u;W) + eJ-^CA I + O(e'), (78) 

Inserting (ED into (ED, we have the subsonic e-expansion of | w) good to first 
order in e 

I w) = Cd (^1 1) -k ^v- I e) -k :| Q) + : Vv \ 1)^ -k 

0{e^). (79) 



10 Hydrodynamic Equations 
ZEROTH MOMENT EQUATION: 



Now we can write the zeroth moment of the Boltzmann equation (I72II by left 
multiplying by {u>\m as follows 



g2£2 ^ 

e^{uj \m\ dtio) + e(iv \ rhcCi \ diUi) H — - — {ui \ mCiCj \ didjU>) 

2t 

The adjoint of (jiSOy is the following 



(w I TO I U). 

( 80 ) 



2«2 



e^{dtto I TO I w) -k e(9iW | time \ to) H — - — A^j^ I C-AA | w) = — (17 | to | w), 

2r T 

( 81 ) 
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and the R.H.S. will also vanishes because of constraint liY.'iil . Because m and L 
commute, adding (EHll and EH) gives us the following zeroth moment equation 

I m I ,.) + I mo£. I ,.) + ^ ((„ I ,h£.£, | a,a,^)+ 

{didjLo I CiCjih I = 0. (82) 



FIRST MOMENT EQUATION: 

Now we can write the first moment of (EJ by left multiplying by {uj\pi as follows 

e^(w I Pi I dtoj) + e{uj \ picCj \ djuj) + \ piCjCk \ djdkuj) = 0, (83) 

zr 

where the R.H.S. vanishes because of the second constraint (IY4II on the collision 
operator. Because p and C commute, adding to its adjoint equation gives 
us the following first moment equation 

e^dt{ua \pi\io) + edj{uj \ picCj \ w) + (^(w | PiCjCk \ djdkuj) + 

{dkdjUJ I CkCjPi I = 0. (84) 

The zeroth and first momentum equations lliS2ll and lliS4ll are partial differential 
equations in the matrix elements. The macroscopic equations of motion, a mass 
continuity equation and a Navier-Stokes equation, come from dH3 and (IHH), 
respectively. 

We can now determine the partial differential equations that describe the 
dynamics of a quantum lattice gas in local equilibrium. Inserting m into the 
zeroth moment equation (18211 and retaining terms up to second order in the 
smallness gives 

e^dt I +ed, \ L, \ = 

_n^ ((^(0) I I + (a,a,w(°) | cdj I , (85) 

using the fact that m = ml. Identifying the local equilibrium mass density (gni. 
p = I and momentum density (HTIl . pvi = \ Li \ we 

arrive at the hydrodynamic equation for mass flow 

e^dtp + edi {pvi) = 

—d, ((o;(°) I I | L,{J-^YU \ 

- I I didjUi^°^) + {didjUJ^°^ I LiLj \ , (86) 
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Similarly, by inserting into the first moment equation m and retaining 
terms up to second order in the smallness gives the hydrodynamic equation for 
momentum flow 

e^dt {pvi)+ = 

-eVidj I I \ \ 

2 

I Citjtk I djdkUJ^^'^) + {dkdjuj^^'^ \ Lkt^U \ , (87) 

where the ideal part of the momentum flux density tensor is defined as 

I 44 I (88) 

To obtain the macroscopic equations of motion, we have to determine the value 
of the matrix elements appearing in ®, (El) and (1^ . This is carried out in 
Appendix ini The result is that in the incompressible limit (V • v = 0), all the 
matrix elements on the R.H.S. of ijiStill vanish leaving us with a mass continuity 
equation 

dtp+ di{pvi) = (89) 

and the matrix elements on the R.H.S. of do not vanish leaving us with a 
viscous Navier-Stokes equatioi0 



dt(pvi) + dg(gpViVj) = -diP + pvd'^Vi. 



In (pH . the pressure is 




and the kinematic shear viscosity is 



e /J_ 

t{D + 2) \Kg 




(90) 



(91) 



(92) 



The form of the kinematic viscosity is identical to that for the classical 
lattice gas as found by Henon I2D]. However, in the case of the quantum lattice 
gas, the value of Kg can be different than the classical value because of quantum 
mechanical interference of outgoing collision possibilities. 



11 Summary 



In this paper we predicted the macroscopic behavior of a lattice-gas quantum 
computer. The following observations are made: 



As is the case for the classical lattice gas, there is a density dependent prefactor 
appearing in the convective term and the pressure, g{d) — ■ 
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1. The <P'-space unitary collision matrix, C, is successively blocked, first over 
the on-site 2 ^-dimensional manifold, C = (2)^=1 U. Each block U is also 
block diagonal over the equivalence classes. 

2. The projection operator, F, periodically causes a partial collapse of the on- 
site superpositions. So a quantum computer with many qubits can simulate 
a quantum lattice gas with only short-term and short-range entanglement 
and coherence of qubits. 

3. Streaming of complex amplitudes (or the associated qubits) occurs in analogy 
to streaming in a classical lattice gas and does not cause global entanglement 
in the quantum lattice gas because of the application of F. 

4. The quantum lattice gas can be understood as existing between two limits, 
a fully coherent quantum system and a classical system. And its single- 
particle distribution function, fa = |woP = {qa\n\qa) = (V’l^alV')) has the 
form fa = 1/ (exp(ap + j3ea-p + ^E) + 1). 

5. Like the lattice Boltzmann equation approach to simulate fluid dynamics, 
the quantum lattice gas is a noiseless method that directly codes the particle 
dynamics at the mesoscopic scale. However, unlike the lattice Boltzmann 
BGK collision operator, the quantum lattice-gas collision operator obeys 
detailed balance. Hence, the method is unconditionally stable. 

6. The macroscopic hydrodynamic behavior is described by a viscous Navier- 
Stokes equation. 

12 Closing Remarks 

To mimic the behavior of other physical systems, quantum lattice gases need 
many qubits. A first generation quantum computer, with a only two qubits, 
cannot test the behavior of the quantum lattice gas at the macroscopic scale. 
However, useful tests could be conducted on a network of these first genera- 
tion machines to test the practicality of the quantum lattice gas formalism. For 
example, we could test the reliability and computational speed of a network 
of quantum computers. It is reasonable to expect that the number of qubits 
will grow exponentially according to Moore’s law as various quantum computer 
designs are realized over time. 

We know from experience with classical lattice gases that even though the 
underlying microscopic dynamics is reversible, dissipative shear viscosity arises 
at the macroscopic scale — entropy increases while at the same time information 
is conserved because of microscopic reversibility. The reason for this is that 
information, initially stored in the spatial correlations of arrangement of particle 
occupancies, in time is transferred into high order particle-particle correlations. 
The same process should occur in a quantum lattice gas. We do not yet know, 
in a quantum lattice gas setting, if there is a way to block this informational 
transfer mechanism and thereby reduce dissipation at the macroscopic scale. 

Why consider constructing a quantum computer following the lattice gas 
paradigm, when a general purpose quantum computer could simulate a quantum 
lattice gas? The answer to this question is three-fold: (1) because any member of 
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the appropriate unitary group associated with an equivalence class block of the 
collision operator is sufficient for the recover of Navier-Stokes hydrodynamics at 
the macroscopic scale, so the lattice-gas quantum computer is robust; (2) short- 
term coherence among only a small number of nearby qubits is needed; and (3) 
its behavior is predictable by analytic means. 

The quantum lattice gas method can be straightfowardly applied to three- 
dimensional fluid simulations (the two-dimensional case was treated in this paper 
because of its simplicity) and also applied to model other physical systems. 
Lattice gases are a special case of cellular automata where conservations and 
detailed balance are imposed and an isotropic spatial lattice is used. With these 
few restrictions removed, the method presented in this paper represents a general 
computational system called a quantum cellular automaton. 
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A Single-Particle Distribution Function 

The single-particle distribution function has the form 

f(za) = — (93) 

+ 1 

where the natural log of the fugacity 

In Za = ap + (3ea-p + jE (94) 

is a linear combination of the conserved scalar quantities, the mass p,the mo- 
mentum component Ca ■ p along the lattice direction Ca, andthe energy E at 
a lattice site. The real numbered coefficients a, (3, and 7 are free parameters 
that we will determine. It is convenient to define the momentum and energy 
independent part of the fugacity as 



2o 



= e 



cup 



Since fa{zo) = d is the reduced density, d = we must set 



(95) 



1-d 



Zo = 



d 



(96) 
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This fixes the coefficient a. To fix the coefficients /3 and 7 , we can specify two 
moments of the single-particle distribution function as constraint conditions. We 
begin by Taylor expanding the single-particle distribution function f{za) about 

f{za) = d+ f{zo)5z + ^f"{zo){Sz‘^) H . (97) 

The derivative of / evaluated at Zo are 

^ (98) 

and 

/"(^) = ^ ^"(^°) = (99) 

SO 

f{za) = d[l — dSz + (f{Sz)‘^]. (100) 

To determine 6z, we begin by writing the fugacity in series form 



Za = Zo 



E 

. fc =0 



{Idea -pY 

k\ 



E 



{lEf 



J Lfc=0 



k\ 



( 101 ) 



In the subsonic limit, p <C me, keeping terms only to second order in the velocity, 
the fugacity becomes 



Za = Zo 



1 + fiea-p + -{Idea 



■p? 



{l + -iE) + 0{v^). 



(102) 



since p ^ v and E ^ . Then to second order in the velocity, the change in Za 

is 



< 5^0 = Za - Zo = 



l-d 



ldea-p + -{Idea ■ pf + lE 



+ 0{v'^) (103) 



and the square of the change is 



{SZaY = 



l-d 



0^ {ea ■ pf + O{v0. 



(104) 



Inserting the expressions for 5z and {Sz)"^ into the Taylor expansion of f{za) we 
have 



f{Za)=dl^l-{l-d) 

= d 



(dea-p + ]^{fdea ■ pY + jE 



-k (1 - dY {Ca ■ pY 



1 - (1 - d) {Pca ■ p + lE) -h i(l - d)(l - 2d)0^ {ca ■ pY 



. (105) 



We have the freedom to choose the coefficients [3 and 7 to parameterized the 
distribution function as we see fit to satisfy any two constraints. Consider a 
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parameterization that fixes the value of the coefficients P and 7 by using the 
following moments for the mass density and momentum density 



II 


(106) 


a—1 

B 

pv = me y^eg/g. 


(107) 



a—1 



The parameterization may be termed the non- Galilean parametrization. Con- 
straints H I and (I I l)7|l are typically used in the formulation of classical lattice 
gases. The single particle distribution function using this non-Galilean param- 
eterization was first found in the mid 1980’s by the US researchers Wolfram 
and Hasslacher and by the French researchers Frisch, d’Humieres, Lallemand, 
Pomeau, and Rivet m- Their derivation of CJ2) given below is different then 
the derivation presented in this section; they used only two free coefficients in 
the expression for the fugacity, one for the mass and the other for the momen- 
tum; whereas we use three free coefficients. The reason for using only two free 
parameters is that in the standard single-speed classical lattice-gas construction, 
the energy is degenerate with the mass, so it was deemed unnecessary to keep a 
separate free coefficient for the energy. Using lllUtiil and (iiuvi as constraint equa- 
tions gives us a non-unity density-dependent prefactor in the convective term in 
the hydrodynamic flow equation. 

Inserting (tTOBI) into 003), the odd term in the distribution function expansion 
survives the first moment sum over lattice directions; the odd term is the one 
linear in the momentum. This fixes the value of /3 to be 



P 



D 

1-d 



(108) 



so the distribution function becomes 



fa = d 



1 -I- Dea ■ p + 



D^l-2d 

1-d 



(ea • pf + (1 - d)^E . 



(109) 



Inserting (imnii into (ITTO - all the even terms that survive the sum over lattice 
directions must add to zero. This fixes the value of 7 as follows 



Dl-2d 
~2 1-d 



(1 - d)jE = 0 



or 



■jE = 



D 



1 — 2d 
(i-d)2y- 



Therefore, the non-Galilean parameterized distribution function is 



( 110 ) 

( 111 ) 



1 -j- ECaiPi T 



D{D + 2) 



^{d^Q aijPiPj 



fa = d 



2 



( 112 ) 
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where the density dependent prefactor g{d) is defined 



g{d) = 



D 1 - 2d 
D+2 1-d 



and the traceless second-rank tensor Qa is defined 



Qaij — ^ai^aj 



D 



(113) 



(114) 



Qa is an isotropic symmetric tensor. This mass-energy degeneracy leads to an 
anomalous description of the lattice-gas fiuid’s behavior. Let us see why. The 
second moment of flTT^ gives the momentum flux density 



B 

mc^ y] eaiBajfa = PSij + gpViVj. (115) 

a—1 



The density-dependent prefactor g appears in the nonlinear convective term, 
so this parametrization does indeed give rise to non-Galilean fluid flow. The 
pressure in (ms!) has a spurious quadratic velocity dependence 



P 



= pc 



1 

S 




(116) 



B Determination of the Matrix Elements 



For the triangular lattice, tensors made up of products of the lattice vectors are 
symmetric and isotropic p. We have the following identities 



(1 1 A 1 1 ) = 0 


(117) 


1 1 CiCj 1 1 ) = 


(118) 


(1 1 C^£,£k 1 1 ) = 0 


(119) 


— _|_ 2^ {dijSki + S^kSji + S^iSjk) 


(120) 



The Jacobion of the collision operator is circulant. Its eigenvectors correspond- 
ing to the nonzero eigenvalues span the kinetic space, which contains a viscous 
subspace characterized by a degenerate eigenvalue, denoted by Krj The eigen- 
vectors in the viscous subspace are CiCj \ 1 ), for i ^ j. Therefore, we have 



I 1) = I 1), 



( 121 ) 



or inverting this over the kinetic viscous modes 

I 1) = — I 1). 

hirj 



(122) 
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Using these identities along with the epsilon expansion of | w), we can work 
out the value of the matrix elements which appear in the mass and momentum 
hydrodynamic equations of the quantum lattice gas at the macroscopic scale. 
We have 



dD 



I \ = d{l \ £,£, | 1) + — (1 | £,£^£^ \ \)vk + 



gdD{D + 2) I - % ) I \)vkvi 



2c2 



D 



gdB 



" D " 2c2 
dB 



{SijSki + SikSji + SilSjk) VkVi — 



gd{D + 2)B , 
2Dc^ ‘ 



^ViVj 



= -jy ( 1-5— • 



(123) 






(0) 



£ £ 



= 0 . 



1 I £i£j£k I l)didjVk 



(124) 



(..<"> I c.c,c, I = !g(l I C,CtC,C, I 



dB 



2c{D + 2) 
dB 

2c{D + 2) 



{2d^dkVk + d‘^v^) . (125) 



(u;(°) I £^J-^£j I = ^(1 I £^J~^£j£k \ l)djVk 



dD 

2ck„ 



(1 I £i£j£k I l)djVk 



= 0 . 



(126) 



(o;(o) I £,£jJ-^£k I = ^(1 | £,£,J~^£k£i \ l)3kVi 



dP 

2cKn 



;i I £i£j£k£i I l)dkvi 



dB 

2cKri{D + 2) 

dB 

2cKri{D + 2) 



{SijSki dikSji diidjk) dkVi 

{dkVkSij + diVj + djVi) . (127) 
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Abstract. We consider the communication complexity of the binary in- 
ner product function in a variation of the two-party scenario where the 
parties have an a priori supply of particles in an entangled quantum 
state. We prove linear lower bounds for both exact protocols, as well as 
for protocols that determine the answer with bounded-error probability. 
Our proofs employ a novel kind of “quantum” reduction from a quan- 
tum information theory problem to the problem of computing the inner 
product. The communication required for the former problem can then 
be bounded by an application of Holevo’s theorem. We also give a spe- 
cific example of a probabilistic scenario where entanglement reduces the 
communication complexity of the inner product function by one bit. 



1 Introduction and Summary of Results 

The communication complexity of a function / : {0, 1}" x {0, 1}" ^ {0, 1} is de- 
fined as the minimum amount of communication necessary among two parties, 
conventionally referred to as Alice and Bob, in order for, say, Bob to acquire 
the value of f{x,y), where, initially, Alice is given x and Bob is given y. This 
scenario was introduced by Yao m and has been widely studied (see for a 
survey). There are a number of technical choices in the model, such as: whether 
the communication cost is taken as the worst-case (x,y), or the average-case 
(x, y) with respect to some probability distribution; whether the protocols are 
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deterministic or probabilistic (and, for probabilistic protocols, whether the par- 
ties have independent random sources or a shared random source); and, what 
correctness probability is required. 

The communication complexity of the inner product modulo two (IP) func- 
tion 



IP{x,y) = xiyi + X 2 V 2 -\ h mod 2 (1) 

is fairly well understood in the above “classical” models. For worst-case inputs 
and deterministic errorless protocols, the communication complexity is n and, for 
randomized protocols (with either an independent or a shared random source), 
uniformly distributed or worst-case inputs, and with error probability ^ — S 
required, the communication complexity is n — 0(log(l/(5)) 0 (see also PI). 

In 1993, Yao P) introduced a variation of the above classical communica- 
tion complexity scenarios, where the parties communicate with qubits, rather 
than with bits. Protocols in this model are at least as powerful as probabilistic 
protocols with independent random sources. Kremer H2] showed that, in this 
model, the communication complexity of IP is I2(n), whenever the required cor- 
rectness probability is 1 — e for a constant 0 < e < ^ (Kremer attributes the 
proof methodology to Yao) . 

Cleve and Buhrman [B| (see also [^) introduced another variation of the 
classical communication complexity scenario that also involves quantum infor- 
mation, but in a different way. In this model, Alice and Bob have an initial 
supply of particles in an entangled quantum state, such as Einstein-Podolsky- 
Rosen (EPR) pairs, but the communication is still in terms of classical bits. They 
showed that the entanglement enables the communication for a specific problem 
to be reduced by one bit. Any protocol in Yao’s qubit model can be simulated 
by a protocol in this entanglement model with at most a factor two increase 
in communication: each qubit can be “teleported” 0 by sending two classical 
bits in conjunction with an EPR pair of entanglement. On the other hand, we 
are aware of no similar simulation of protocols in the entanglement model by 
protocols in the qubit model, and, thus, the entanglement model is potentially 
stronger. 

In this paper, we consider the communication complexity of IP in two scenar- 
ios: with prior entanglement and qubit communication; and with prior entangle- 
ment and classical bit communication. As far as we know, the proof methodology 
of the lower bound in the qubit communication model without prior entangle- 
ment m does not carry over to either of these two models. Nevertheless, we 
show I2(n) lower bounds in these models. 

To state our lower bounds more precisely, we introduce the following notation. 
Let / : {0, 1}” X {0, 1}" — > {0, 1} be a communication problem, and 0 < e < i. 
Let Qlif) denote the communication complexity of / in terms of qubits, where 
quantum entanglement is available and the requirement is that Bob determines 
the correct answer with probability at least 1 — e (the * superscript is intended to 
highlight the fact that prior entanglement is available). Also, let C'*(/) denote 
the corresponding communication complexity of / in the scenario where the 
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communication is in terms of bits (again, quantum entanglement is available 
and Bob is required to determine the correct answer with probability at least 
1 — e). When e = 0, we refer to the protocols as exact, and, when e > 0, we refer 
to them as bounded-error protocols. With this notation, our results are: 



Q*o{IP) = \n/2^ 


(2) 


QUIP) > i(l-2£)2n- i 


(3) 


C*{IP)=n 


(4) 


C*{IP) > max(i(l — 2£)^, (1 — 2e)^)n — ^ 


(5) 



Note that all the lower bounds are fi{n) whenever e is held constant. Also, these 
results subsume the lower bounds in ca, since the qubit model defined by Yao 
na differs from the bounded-error qubit model defined above only in that it 
does not permit a prior entanglement. 

Our lower bound proofs employ a novel kind of “quantum” reduction between 
protocols, which reduces the problem of communicating, say, n bits of informa- 
tion to the IP problem. It is noteworthy that, in classical terms, it can be shown 
that there is no such reduction between the two problems. The appropriate cost 
associated with communicating n bits is then lower-bounded by the following 
nonstandard consequence of Holevo’s theorem. 

Theorem 1: In order for Alice to convey n bits of information to Bob, where 
quantum entanglement is available and qubit communication in either direction 
is permitted, Alice must send Bob at least \n/2\ qubits. This holds regardless of 
the prior entanglement and the qubit communication from Bob to Alice. More 
generally, for Bob to obtain m bits of mutual information with respect to Alice’s 
n bits, Alice must send at least |"m/2] qubits. 

A slight generalization of Theorem 1 is described and proven in the Appendix. 

It should be noted that, since quantum information subsumes classical infor- 
mation, our results also represent new proofs of nontrivial lower bounds on the 
classical communication complexity of IP, and our methodology is fundamen- 
tally different from those previously used for classical lower bounds. 

Finally, with respect to the question of whether quantum entanglement can 
ever be advantageous for protocols computing IP, we present a curious proba- 
bilistic scenario with n = 2 where prior entanglement enables one bit of commu- 
nication to be saved. 

2 Bounds for Exact Qubit Protocols 

In this section, we consider exact qubit protocols computing IP, and prove 
Eq. 021). Note that the upper bound follows from so-called “superdense coding” 
1^: by sending \n/2'\ qubits in conjunction with [n/2] EPR pairs, Alice can 
transmit her n classical bits of input to Bob, enabling him to evaluate IP. For 
the lower bound, we consider an arbitrary exact qubit protocol that computes 
IP, and convert it (in two stages) to a protocol for which Theorem 1 applies. 
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For convenience, we use the following notation. If an m-qubit protocol con- 
sists of mi qubits from Alice to Bob and m 2 qubits from Bob to Alice then we 
refer to the protocol as an (mi, m 2 )-qubit protocol. 



2.1 Converting Exact Protocols into Clean Form 



A clean protocol is a special kind of qubit protocol that follows the general spirit 
of the reversible programming paradigm in a quantum setting. Namely, one in 
which all qubits incur no net change, except for one, which contains the answer. 
In general, the initial state of a qubit protocol is of the form 



\yi 



, y„) |0, . . . , 0) \^ba) |xi, . . . , a:„) |0, . . . , 0), 



( 6 ) 



Bob’s qubits 



Alice’s qubits 



where \<Pba) is the state of the entangled qubits shared by Alice and Bob, and the 
|0, . . . ,0) states can be regarded as “ancillas”. At each turn, a player performs 
some transformation (which, without loss of generality, can be assumed to be 
unitary) on all the qubits in his/her possession and then sends a subset of these 
qubits to the other player. Note that, due to the communication, the qubits 
possessed by each player varies during the execution of the protocol. At the end 
of the protocol, Bob measures one of his qubits which is designated as his output. 

We say that a protocol which exactly computes a function /(x, y) is clean if, 
when executed on the initial state 



\z)\yi, . . . ,?/n)|0, . . . , 0)\<Pba)\xi, . . . ,x„)|0, . . . , 0), (7) 

results in the final state 



1^ + f{x,y))\yi, . . . ,2/„)|0, . . . , 0)\^ba)\xi,. . . ,x„)|0, . . . , 0) (8) 

(where the addition is mod 2). The “input”, the ancilla, and initial entangled 
qubits will typically change states during the execution of the protocol, but they 
are reset to their initial values at the end of the protocol. 

It is straightforward to transform an exact (mi, m 2 )-qubit protocol into a 
clean (mi -|- m 2 , mi -|- m 2 )-qubit protocol that computes the same function. To 
reset the bits of the input, the ancilla, and the initial entanglement, the protocol 
is run once, except the output is not measured, but recorded and then the pro- 
tocol is run in the backwards direction to “undo the effects of the computation” . 
The output is recorded on a new qubit of Bob (with initial state ji;)) which is 
control-negated with the output qubit of Bob (that is in the state \f{x,y))) as 
the control. Note that, for each qubit that Bob sends to Alice when the protocol 
is run forwards, Alice sends the qubit to Bob when run in the backwards direc- 
tion. Running the protocol backwards resets all the qubits — except Bob’s new 
one — to their original states. The result is an (mi -I- m 2 , mi -I- m 2 )-qubit protocol 
that maps state (01 to state (Q. 
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2.2 Reduction from Communication Problems 

We now show how to transform a clean (mi + m 2 , mi + TO 2 )-qubit protocol that 
exactly computes IP for inputs of size n, to an (mi+m 2 , mi+m 2 )-qubit protocol 
that transmits n bits of information from Alice to Bob. This is accomplished in 
four stages: 

1. Bob initializes his qubits indicated in Eq. o with z = 1 and yi = ■ ■ ■ = 

Vn — 0- 

2. Bob performs a Hadamard transformation on each of his first n + 1 qubits. 

3. Alice and Bob execute the clean protocol for the inner product function. 

4. Bob again performs a Hadamard transformation on each of his first n + 1 
qubits. 

Let \Bi) denote the state of Bob’s first n + 1 qubits after the stage. Then 

(9) 

l)“|a)|6i,...,6„) (10) 

l)“|a + biXi + • • • + bnXn)\bl, . . . , bn) 

(11) 
( 12 ) 

where, in Eq. dm, the substitution c = a + &ia:i + • • • + bnXn has been made (and 
arithmetic over bits is taken mod 2). The above transformation was inspired by 
the reading of (see also |S|). 

Since the above protocol conveys n bits of information (namely, x\, . . . ,Xn) 
from Alice to Bob, by Theorem 1, we have toi + m 2 > n/2. Since this protocol 
can be constructed from an arbitrary exact (mi, m 2 )-qubit protocol for IP, this 
establishes the lower bound of Eq. 0. 

Note that, classically, no such reduction is possible. For example, if a clean 
protocol for IP is executed in any classical context, it can never yield more than 
one bit of information to Bob (whereas, in this quantum context, it yields n bits 
of information to Bob) . 

3 Lower Bounds for Bounded-Error Qubit Protocols 

In this section we consider bounded-error qubit protocols for IP, and prove 
Eq. (0. Assume that some qubit protocol P computes IP correctly with prob- 
ability at least 1 — e, where 0 < e < Since P is not exact, the constructions 
from the previous section do not work exactly. We analyze the extent by which 
they err. 

First, the construction of Section 2.1 will not produce a protocol in clean 
form; however, it will result in a protocol which approximates an exact clean 



|Ri) = |l)| 0 ,..., 0 ) 

152) = -7^ ^ (- 

153) = 77^ E (- 

a,bi,...,b„G{0,l} 

E (- 

IR4) = |l)|xi,...,a:„). 





TT 


1 








1 




\/2” 


+ 1 
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protocol (this type of construction was previously carried out in a different con- 
text by Bennett et al. 0). 

Denote the initial state as 

|j/i, . . . ,y„)|0, . . . , , x„)|0, . . . , 0). (13) 

Also, assume that, in protocol P, Bob never changes the state of his input qubits 
|?/i, . . . , yn) (so the first n qubits never change). This is always possible, since he 
can copy yi, . . . , into his ancilla qubits at the beginning. After executing P 
until just before the measurement occurs, the state of the qubits must be of the 
form 

a|yi, . . . ,y„)|x • y)|J) -f /3|yi , . . . ,yn)\x^)\K), (14) 

where \a\^ > (1 — e) and |/3p < e. In the above, the n-!-!®* qubit is the designated 
output, X ■ y denotes the inner product of x and y, and x ■ y denotes the negation 
of this inner product. In general, a, /3, | J), and \K) may depend on x and y. 

Now, suppose that the procedure described in Section 2.1 for producing a 
clean protocol in the exact case is carried out for P. Since, in general, the answer 
qubit is not in the state \x ■ y ) — or even in a pure basis state — this does not 
produce the final state 

k + a: • y) |yi, . ■ . , y«) |0, . . . , Q)\<^ba) |xi, . . . , |0, . . . , 0) . (15) 

However, let us consider the state that is produced instead. After introducing 
the new qubit, initialized in basis state \z)^ and applying P, the state is 

\z) (a|yi, . . . ,y„)|x • y)| J) -h /3|yi, . . . , y„)|x^)|AT)) . (16) 

After applying the controlled-NOT gate, the state is 

a\z + X ■ y)\yi, . . . , y„)|x • y)| J) + f3\z + W^)\yi , . . . , y„)|x^)|Ar) 

= a\z + x- y)|yi, . . . ,y„)|a; • y)| J) + P\z + x ■ y)|yi, . . . , y„)|x^)|AT) 

-P\z + x-y)\yi,.. .,yn)\x^)\K) + (3\z + x^)|yi, . . .,yn)\x^)\K) 

= \z + x-y) (a|yi, . . . ,y„)|x • y)| J) -h /3|yi, . . . ,y„)|x^)|AT)) 

+ + \yi^--->yn)\^)\K). (17) 

Finally, after applying P in reverse to this state, the final state is 
\z + x- y)|yi, . . . ,y„)|0, . . . ,0)\^ba)\xi, . . . ,a:„)|0, . . . ,0) -h (18) 

where 

+ y)) ■ • ■ > 2 /n)|S^)|AT). (19) 

Note that the vector V2P\Mx,y,z) is the difference between what an exact 
protocol would produce (state (1 1 61) 1 and what is obtained by using the inexact 
(probabilistic) protocol P (state llisll 'l. There are some useful properties of the 
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\^x,y,z) states. First, as y G {0, 1}” varies, the states \Mx^y^z) are orthonormal, 
since |yi, . . . ,t/n) is a factor in each such state (this is where the fact that Bob 
does not change his input qubits is used). Also, \Mx,y,o) = since only 

the (-^1-2 + 5T^) — -^\z + X ■ y)) factor in each such state depends on z. 

Call the above protocol P. Now, apply the four stage reduction in Section 
2.2, with P in place of an exact clean protocol. The difference between the state 
produced by using P and using an exact clean protocol first occurs after the 
third stage and is 



/2"+i 






vi,...,y„,ze{0,l} 

= ^2n+l (|Afx,y,o) ~ \Mx,y,l)) 

J/l,....yn6{0,l} 

“ Vl" ^y\Mx,y,o) , 



( 20 ) 



which has magnitude bounded above by 2y/e, since, for each yG {0, 1}”, |/3yP <e, 
and the \Mx,yfl) states are orthonormal. Also, the magnitude of this difference 
does not change when the Hadamard transform in the fourth stage is applied. 
Thus, the final state is within Euclidean distance 2y/e from 



|l)|a:i, . . . ,x„)|0, . . . ,0)\<Pba)\xi, ■ . . ,a:„)|0, ... ,0). (21) 

Consider the angle 9 between this final state and (ED- It satisfies sin^ 9 + {1 — 
cos 9)^ < 4e, from which it follows that cos 9 > 1 — 2e. Therefore, if Bob mea- 
sures his first n -|- 1 qubits in the standard basis, the probability of obtaining 
|1, xi, . . . , Xn) is cos^ 9 > {I — 2e)^. 

Now, suppose that xi, ... ,Xn are uniformly distributed. Then Fano’s inequal- 
ity (see, for example, |0|) implies that Bob’s measurement causes his uncertainty 
about Xi, . . . , x„ to drop from n bits to less than (1 — (1 — 2s)^)n + h((l — 2e)^) 
bits, where h(x) = —xlogx — (1 — x) log(l — x) is the binary entropy function. 
Thus, the mutual information between the result of Bob’s measurement and 
(xi, . . . , x„) is at least (1 — 2e)^n — /i((l — 2e)^) > (1 — 2e')^n — 1 bits. By Theo- 
rem 1, the communication from Alice to Bob is at least i(l — 2e)^n — ^ qubits, 
which establishes Eq. (0. 



4 Lower Bounds for Bit Protocols 

In this section, we consider exact and bounded-error bit protocols for IP, and 
prove Eqs. m and 0. 

Recall that any m-qubit protocol can be simulated by a 2m-bit protocol using 
teleportation P) (employing EPR pairs of entanglement). Also, if the commu- 
nication pattern in an m-bit protocol is such that an even number of bits is 
always sent during each party’s turn then it can be simulated by an m/2-qubit 
protocol by superdense coding ^ (which also employs EPR pairs). However, this 
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latter simulation technique cannot, in general, be applied directly, especially for 
protocols where the parties take turns sending single bits. 

We can nevertheless obtain a slightly weaker simulation of bit protocols by 
qubit protocols for IP that is sufficient for our purposes. The result is that, given 
any m-bit protocol for (that is, IP instances of size n), one can construct an 
m-qubit protocol for IP 2 n - This is accomplished by interleaving two executions of 
the bit protocol for IPn to compute two independent instances of inner products 
of size n. We make two observations. First, by taking the sum (mod 2) of the two 
results, one obtains an inner product of size 2n. Second, due to the interleaving, 
an even number of bits is sent at each turn, so that the above superdense coding 
technique can be applied, yielding a (2m) j 2 = m-qubit protocol for IP 2 n- Now, 
Eq. (0 implies m > n, which establishes the lower bound of Eq. 0 (and the 
upper bound is trivial). 

If the same technique is applied to any m-bit protocol computing with 
probability 1 — e, one obtains an m-qubit protocol that computes IP 2 n with 
probability (1 — + e^ = l — 2e(l — e). Applying Eq. (jSI) here, with 2n replacing 

n and 2e(l — e) replacing e, yields m > (1 — 2£r)'^n— For e > = 0.146 . . ., 

a better bound is obtained by simply noting that C* > Ql (since qubits can 
always be used in place of bits) , and applying Eq. (0) . This establishes Eq. (0 . 

5 An Instance Where Prior Entanglement Is Beneficial 

Here we will show that in spite of the preceding results, it is still possible that a 
protocol which uses prior entanglement outperforms all possible classical proto- 
cols. This improvement is done in the probabilistic sense where we look at the 
number of communication bits required to reach a certain reliability threshold 
for the IP function. This is done in the following setting. 

Both Alice and Bob have a 2 bit vector 3 : 1 X 2 and yiy 2 , for which they want 
to calculate the inner product modulo 2: 

/(x, 2 /) = xi 2 /i -I- X 2 J /2 mod 2 (22) 

with a correctness-probability of at least |. It will be shown that with entangle- 
ment Alice and Bob can reach this ratio with 2 bits of communication, whereas 
without entanglement 3 bits are necessary to obtain this success-ratio. 



5.1 A Two-Bit Protocol with Prior Entanglement 

Initially Alice and Bob share a joint random coin and an EPR-like pair of qubits 
Qa and Qb- 



state(Q^gs) = ^(|00) + |11)) (23) 



With these attributes the protocol goes as follows. 
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First Alice and Bob determine by a joint random coin flipQwho is going to be 
the ‘sender’ and the ‘receiver’ in the protocol. (We continue the description of the 
protocol by assuming that Alice is the sender and that Bob is the receiver.) After 
this, Alice (the sender) applies the rotation on her part of the entangled 

pair and measures this qubit Qa in the standard basis. The result rriA of this 
measurement is then sent to Bob (the receiver) who continues the protocol. 

If Bob has the input string ‘00’, he knows with certainty that the outcome 
of the function f{x,y) is zero and hence he concludes the protocol by sending 
the bit 0 to Alice. Otherwise, Bob performs the rotation By^y^ on his part of the 
entangled pair Qb and measure it in the standard basis yielding the value ttib- 
Now Bob finishes the protocol by sending to Alice the bit niA + tub mod 2. 

Using the rotations shown below and bearing in mind the randomization pro- 
cess in the beginning of the protocol with the joint coin flip, this will be a protocol 
that uses only 2 bits of classical communication and that gives the correct value 
of f{x, y) with a probability of at least | for every possible combination of X\X 2 
and yiy 2 - 

The unitary transformations used by the sender in the protocol are: 




whereas the receiver uses one of the three rotations: 




The matrices were found by using an optimization program that suggested cer- 
tain numerical values. A closer examination of these values revealed the above 
analytical expressions. 



^ Because a joint random coin flip can be simulated with an EPR-pair, we can also 
assume that Alice and Bob start the protocol with two shared EPR-pairs and no 
random coins. 
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5.2 No Two-Bit Classical Probabilistic Protocol Exists 



Take the probability distribution tt on the input strings x and y, defined by: 



, s f 0 iff a: = 00 or 2 / = 00 

nx, y) - I 1 iff 2 - ^ 00 and 2 / 00 



(26) 



It is easily verified that for this distribution, every deterministic protocol with 
only two bits of communication will have a correctness ratio of at most | . Using 
Theorem 3.20 of this shows that every possible randomized protocol with 
the same amount of communication will have a success ratio of at most (It 
can also be shown that this ^ bound is tight but we will omit that proof here.) 
This implies that in order to reach the requested ration of at least three bits of 
communication are required if we are not allowed to use any prior entanglement. 



5.3 Two Qubits Suffice Without Prior Entanglement 

A similar result also holds for qubit protocols without prior entanglement |T7j . 
This can be seen by the fact that after Alice applied the rotation and 

measured her qubit Qa with the result tua = 0, she knows the state of Bob’s 
qubit Qb exactly. It is therefore also possible to envision a protocol where the 
parties assume the measurement outcome ruA = 0 (this can be done without 
loss of generality), and for which Alice simply sends this qubit Qs to Bob, after 
which Bob finishes the protocol in the same way as prescribed by the ‘prior 
entanglement’-protocol. The protocol has thus become as follows. 

First Alice and Bob decide by a random joint coin flip who is going to be 
the sender and the receiver in protocol. (Again we assume here that Alice is the 
sender.) Next, Alice (the sender) sends a qubit \Qxix 2 ) (according to the input 
string x\X 2 of Alice and the table 1271) to the receiver Bob who continues the 
protocol. 

IQoo) = Y^|0) - IQoi) = Y^|0) + (\/^ + 

(27) 

IQio) = + (“V^ + |i) l*3ii) = 

If Bob has the input string 2 / 12/2 = 00, he concludes the protocol by sending a zero 
bit to Alice. In the other case. Bob applies the rotation By^y^ to the received 
qubit, measures the qubit in the standard basis, and sends this measurement 
outcome to Alice as the answer of the protocol. By doing so, the same correctness- 
probability of I is reached for the IP function with two qubits of communication, 
whereas the classical setting requires 3 bits of communication as shown above. 
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Appendix: Capacity Results for Communication Using 
Qubits 



In this appendix, we present results about the quantum resources required to 
transmit n classical bits between two parties when two-way communication is 
available. These results are used in the main text in the proof of the lower bound 
on the communication complexity of the inner product function, and may also 
be of independent interest. 

Theorem 2: Suppose that Alice possesses n bits of information, and wants to 
convey this information to Bob. Suppose that Alice and Bob possess no prior 
entanglement but qubit communication in either direction is allowed. Let uab 
be the number of qubits Alice sends to Bob, and ns a the number of qubits Bob 
sends to Alice (uab d-nd uba are natural numbers). Then, Bob can acquire the 
n bits if and only if the following inequalities are satisfied: 

nAB > \n/2\ (28) 

nAB + nBA > n. (29) 

More generally, Bob can acquire m bits of mutual information with respect to 
Alice’s n bits if and only if the above equations hold with m substituted for n. 

Note that Theorem 1 follows from Theorem 2 because, if the communication 
from Bob to Alice is not counted then this can be used to set up an arbitrary 
entanglement at no cost. 

Graphically, the capacity region for the above communication problem is 
shown in Fig. 0 Note the difference with the classical result for communication 
with bits, where the capacity region is given by the equation uab > a; that is, 
classically, communication from Bob to Alice does not help. 




Fig. 1. Capacity region to send n bits from Alice to Bob. uab is the number of 
qubits Alice sends to Bob, and tiba is the number of qubits Bob sends to Alice. 
The dashed line indicates the bottom of the classical capacity region. 
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Proof of Theorem 2: The sufficiency of Eqns. and follows from the 
superdense coding technique 0. The nontrivial case is where uab < n. Bob 
prepares n — uab < riBA EPR pairs and sends one qubit of each pair to Alice, 
who can use them in conjunction with sending n — uab < it-ab qubits to Bob to 
transmit 2(n — bits to Bob. Alice uses her remaining allotment of 2nAB — n 
qubits to transmit the remaining 2tiab — n bits in the obvious way. 

The proof that Eqns. II'/! Ml and are necessary follows from an applica- 
tion of Holevo’s Theorem HH, which we now review. Suppose that a classical 
information source produces a random variable X . Depending on the value, x, 
of X, a quantum state with density operator is prepared. Suppose that a 
measurement is made on this quantum state in an effort to determine the value 
of X. This measurement results in an outcome Y . Holevo’s theorem states that 
the mutual information I{X : Y) between X and Y is bounded by the Holevo 



where Px are the probabilities associated with the different values of X, p = 
'^xPxPx, and S is the von Neumann entropy function. The quantity on the right 
hand side of the Holevo bound is known as the Holevo chi quantity, x{Px ) = 



Let X be Alice’s n bits of information, which is uniformly distributed over 
{0, 1}". Without loss of generality, it can be assumed that the protocol between 
Alice and Bob is of the following form. For any value (xi, . . . ,Xn) of X, Alice 
begins with a set of qubits in state |xi, . . . , x„)|0, . . . , 0) and Bob begins with a 
set of qubits in state |0, . . . , 0). The protocol first consists of a sequence of steps, 
where at each step one of the following processes takes place. 

1. Alice performs a unitary operation on the qubits in her possession. 

2. Bob performs a unitary operation on the qubits in his possession. 

3. Alice sends a qubit to Bob. 

4. Bob sends a qubit to Alice. 

After these steps, Bob performs a measurement on the qubits in his possession, 
which has outcome Y. (Note that one might imagine that the initial states could 
be mixed and that measurements could be performed in addition to unitary 
operations; however, these processes can be simulated using standard techniques 
involving ancilla qubits.) 

Let pf be the density operator of the set of qubits that are in Bob’s possession 
after i steps have been executed. Due to Holevo’s Theorem, it suffices to upper 
bound the final value of x{pf) — which is also bounded above by S{pi). We 
consider the evolution of x(pf) and S{pi). Initially, x{Po) = *5'(po) = 0, since 
Bob begins in a state independent of X. Now, consider how x(pf') and S{pi) 
change for each of the four processes above. 



bound El 




(30) 



X 



s{p) -T.xPxS{px)- 
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1. This does not affect pf and hence has no effect on x{pf) or S{pi). 

2. It is easy to verify that x ^nd S are invariant under unitary transformations, 
so this does not affect x{pf) and S{pi) either. 

3. Let B denote Bob’s qubits after i steps and Q denote the qubit that Alice 
sends to Bob at the i + step. By the subadditivity inequality and the fact 
that, for a single qubit Q, S{Q) < 1, S{BQ) < S{B) + S{Q) < S{B) + 1. 
Also, by the Araki-Lieb inequality P, S{BQ) > S{B) — S{Q) > s\b) — 1. 
It follows that 5'(pi+i) < S{pi) + 1 and 

X{pf+i) = S{p^+i) - 

rcG{0,l}" 

<(5(p.) + l)- E 

xG{0,l}" 

= x(pf) + 2. (31) 

4. In this case, is pf with one qubit traced out. It is known that tracing out 
a subsystem of any quantum system does not increase x Cni, so x{phi) ^ 
x{pf)- Note also that 5'(pi+i) < 5'(pi) + l for this process, by the Araki-Lieb 
inequality p. 

Now, since x{pf) only increase when Alice sends a qubit to Bob and by 
at most 2, Eq. follows. Also, since S{pi) can only increase when one party 
sends a qubit to the other and by at most 1, Eq. (EU follows. This completes 
the proof of Theorem 2. 



Quantum Recurrent Networks for 
Simulating Stochastic Processes 



Michail Zal|]& Colin P. William|| 

Ultracomputing Group, Mail Stop 126-347, Jet Propulsion Laboratory, 
California Institute of Technology, Pasadena, CA 91109-8099 
colin@solstice.jpl.nasa.gov, zak@solstice.jpl.nasa.gov 



Abstract. We introduce the concept of quantum recurrent networks hy incor- 
porating classical feedback loops into conventional quantum networks. We 
show that the dynamical evolution of such networks, which interleave quantum 
evolution with measurement and reset operations, exhibit novel dynamical 
properties finding application in pattern recognition, optimization and simula- 
tion. Moreover, decoherence in quantum recurrent networks is less problematic 
than in conventional quantum network architectures due to the modest phase 
coherence times needed for network operation. 



Introduction 

Large scale classical simulations of stochastic processes require vast quantities of 
random numbers. However, since the pioneering work of Church, Turing, Post and 
Godel, it has been known that classical computers can only compute functions. In 
other words, the class of tasks that can be accomplished with a classical computer is 
exactly equivalent to the class of computable functions. However, as there is no 
function for computing a true random number, classical computers can only feign 
randomness. The purported calls to the “random number generator” often seen in 
modern programming languages are, in reality, calls to a pseudo- random number 
generator. A pseudo-random number generator is a deterministic function whose 
successive outputs pass many of the statistical tests of randomness. Unfortunately, 
the sequence of outputs can also harbor subtle correlations that are not immediately 
apparent from the common statistical measures of randomness. 

To illustrate this point vividly, consider the RANDU “linear congruential gen- 
erator”, a notoriously bad pseudo-random number generator that was common on 
IBM mainframes of the 1960s. A linear congruential generator is defined by: 

A^t+i = +tn) mod n 
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where I, m, n are fixed integers and A: = 1, 2, 3, . . . The resulting sequence of num- 
bers, appears, superficially, to generate a set of random samples from a uniform dis- 
tribution that lie in the range 0 to n-\ inclusive. We say “superficially” in the sense 
that, the sequence of numbers Ni, N 2 , ...passes many statistical tests of randomness. 
However, there is a subtle correlation lurking amongst these numbers that becomes 
apparent if you use them to choose a set of (supposedly) random points in a high 
dimensional space. In particular, if, as in RANDU, £ = 65539, n = 2^\ m = 0 and 
Aq = 1 then successive triples produced by the generator, Aj. , Aj.^, , Aj .+2 > be 
taken to define the x-, y- and z-coordinates of a point in a 3-dimensional space. These 
points are plotted in fig.l below from different viewing angles. From most viewing 
angles the points appear to be randomly distributed. But from a particular viewing 
angle you can see that they are not at all randomly distributed. In fact they lie in a set 
of parallel planes. Thus the sequence of supposedly “random” numbers output from 
the linear congruential generator are not random at all and could give misleading 
results if used in a numerical simulation of a stochastic process. 




Fig. 1 The same cube of points seen from four different viewpoints 

Pseudo-random number generators are better today than they were in 1960. 
But one should not be complacent. As recently as 1992 bugs in a supposedly “good” 
pseudo-random number generator were discovered when a numerical simulation of an 
Ising spin system was performed as a test of the simulator against a known bench- 
mark for which analytical results were known [Ferrenberg et al. 92]. 
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One way to correct the problems inherent with pseudo-random number gen- 
erators is to build a generator that exploits a truly random process itself. Strictly 
speaking, there is no such thing as randomness in classical physics. Nevertheless, 
certain classical dynamical systems exhibit a high degree of instability that would 
seem to make them reasonable candidates for effectively random number generators. 
Unfortunately, yet again one must be careful. Recently a study of the use of the lo- 
gistic map, one of the simplest generators of deterministic chaos, has revealed that the 
chaotic sequence of numbers output does not possess exactly the same statistical 
properties as a truly random sequence [Phatak & Rao 95]. The deviation might be 
quite slight, but in situations where billions upon billions of random numbers are 
needed it could lead a Monte Carlo simulation astray. 

The moral is that you cannot use classical physics to generate truly random 
numbers. However, in quantum physics, the non-deterministic outcome of a meas- 
urement made on a system in a superposed quantum state is, as a matter of principle, 
random. Unfortunately, quantum non-determinism is generally regarded as being of 
lesser importance than other quantum phenomena such as quantum interference and 
entanglement. This is partly because many people believe, mistakenly, that pseudo- 
random number generators are “good enough” and partly because the impressive 
speedups exhibited by quantum algorithms for factoring composite integers and for 
finding an item in an unsorted database, are due to interference and entanglement 
effects rather than non-determinism. However, such a dismissal of quantum non- 
determinism is premature. No matter how good new pseudo-random number gen- 
erators are purported to be, their adequacy can only be assessed empirically within the 
context of a specific application. Moreover, the key quantum effect on which quan- 
tum cryptography depends is quantum non-determinism. We argue that as quantum 
non-determinism is, intrinsically, a random process, it provides a much better basis 
for the design of a random number generator. 

It is easy to define a quantum procedure for selecting random integers in the 
range 0 to 2"-l by preparing n qubits in the state |0)|0)|0)---|0) , applying the 

Walsh-Hadamard transform to each qubit separately, creating an equal superposition 
of all possible states of the register, and then reading the memory register. Once you 
have a mechanism for generating uniformly distributed random numbers you can 
create a generator for any other distribution using a function transformation[Tuckwell 
95]. 

However true random number generation is not the same as true stochastic 
process generation. For example, in a Markov process the probability of obtaining a 
particular outcome for the next state depends upon the identity of the last state visited. 
By contrast the sequence of outcomes from a quantum random number generator are 
independent, identically distributed random variables. It is therefore interesting to 
ask whether there is a more direct way of using quantum mechanics to simulate sto- 
chastic processes? 




78 



M. Zak and C.P. Williams 



Quantum Recurrent Networks 

We can begin be asking what general features must such a simulator possess? First, 
we need to be able to generate a sequence of classically observable samples. This 
suggests that we are going to have to imagine a quantum device that allows repeated 
measurements. Second, we need to he able bias the probability that a given state will 
appear as the next output given knowledge of some or all of the previous outputs. 
This is because, by definition, stochastic processes possess such a memory effect. 
The simplest way to accommodate such a memory effect is to imagine that the device 
is reset in a new state that somehow takes account of the states visited so far. These 
considerations lead naturally to our notion of a “quantum recurrent network”. 

A quantum recurrent network consists of a conventional quantum network 
augmented with a classical measurement and quantum reset operation. The design of 
a one dimensional quantum recurrent network is shown in Fig. 2. 



r(o)) 




Fig. 2. A One dimensional quantum recurrent network 

An initial state, , is fed into the network, transformed under the action of a 
unitary operator, U , subjected to a measurement, indicated by the measurement 
operator M{ } , and the result of the measurement is used to control the new state fed 

back into the network at the next iteration. One is free to record, duplicate or even 
monitor the sequence of measurement outcomes, as they are all merely bits and hence 
constitute classical information. Moreover, one is free to choose the function used 
during the reset phase, including the possibility of adding no offset state whatsoever. 
Such flexibility makes the QRN architecture remarkably versatile. To simulate a 
Markov process, it is sufficient to return just the last output state to the next input at 
each iteration. 



Quantum Reset Operation 

The reset operation can be accomplished using conditional quantum logic. The basic 
strategy is to condition the operation performed on the offset state 1 upon the last 

measured outcome, . For example, | describes the state of 3 qubits, then the 
reset circuit will have the form shown in Fig. 3. 
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Fig. 3. Most general quantum reset circuit. 



Here, a white dot represents the binary value 0 and a black dot the binary value 1. 
Mathematically, such a reset circuit is described by a 64 x 64 block diagonal unitary 
matrix in which each block, 5, , is an 8x8 unitary matrix. 
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Thus if the last measured output were, say, = |01l) then the state re-entering the 

circuit at the next iteration will be S'j | . An unfortunate drawback of the reset 

scheme is that the circuit contains an exponential number of gates. Later, we shall see 
that it is possible to invent alternative reset strategies that do not require exponentially 
sized circuits. 



Network Dynamics 

By design, we imbue the quantum recurrent network with a discrete time evolution 
according to the equation; 
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where | is the input to the network at time t, t/ is a unitary operator defined by 
U = exp{iH At Ih), and M is a measurement operator (in the computational basis) 
that has the effect of projecting the evolved state f/|t//(t)^ into one of the eigenvec- 
tors of M . The curly brackets are intended to emphasize that M is to be taken as a 
measurement operation not a matrix product. In the simplest case, the combination 
function / , can be merely addition followed by renormalization, giving the special- 
ized update rule: 

\y/{t + At))= M|c/|(/z(t))}-r|(/z(0)) 

where the notation | • | represents renormalization. As each iteration involves a 

measurement and reset operation, decoherence, the phenomenon that bedevils most 
hypothetical quantum computations, can be largely ignored, as the quantum recurrent 
networks need only operate coherently in between successive measurement opera- 
tions; an interval of duration At. 

In general, if one were to record the sequence measurement outcomes, it would not 
settle down to a predictable pattern. Instead, the sequence would hop about erratically 
between a finite, but possibly exponentially large, number of states, executing a true 
stochastic process. If the initial, i.e., offset, state vector is 



f «o(0)^ 




and M is a measurement operator in the computational basis, then | y/{t + At)'j , the 
recurrent state re-entering the circuit, must consist of the sum of the offset state, 
I plus quantum state (constructed afresh), | , that corresponds to the last 

measured outcome, i . Hence, the recurrent state takes one of the forms: 

i+«o(or 
0 ,( 0 ) _ 1 
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with re-normalization factors: 

Rq = |l-f -f|r/ 1(0^1 -f . . . 
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Thus, the recurrent (quantum) states entering the circuit and the measured (classical) 
outcomes follow the same Markov process. The transition probability matrix, T ^ , for 

this process is given by examining how each of the recurrent states, 



evolve under the action of U: 
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where 

N-l N-l 

bT j, a, ( 0 ). 

1=0 e=o 

Ty specifies, therefore, a classical transition probability matrix between a set of 



quantum states (the recurrent states) or equivalently between a set of classical states 
(the measurement outcomes). 



Stochastic Attractors 

The process defined by the transition probability matrix Tj generates a truly random 
sequence of eigenstates. Although, for a given choice of U and |i/^(o)) every realiza- 
tion of the process will, in general, yield a different sequence of states, the statistical 
properties of these sequences, such as the frequencies of the various states visited, 
will eventually converge to a fixed distribution. This fixed distribution, is called a 
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stochastic attractor, and may be calculated as the fixed point of T^ acting on any re- 
entrant state \(l>i) ■ 

Thus the quantum recurrent network provides a mechanism for generating true 
stochastic attractors. Our model uses neither pseudo-random number generators nor 
classical white noise. The time taken to converge to this attractor is governed by the 
size of the largest eigenvalue of the transition matrix T^ . 

To be useful for Monte Carlo simulation, one would like to be able to tailor 
the quantum recurrent network so that it generates stochastic attractors that have 
prescribed characteristics. This can be accomplished by a process called “learning”. 
“Learning” consists of adapting model parameters until the quantum recurrent net- 
work produces the desired stochastic attractor to within an acceptable tolerance. This 
can be accomplished by varying the initial state fed into the network, |v^(o)) , the 

duration of the coherent evolution phase, At, or by selecting a different unitary ma- 
trix, U. From a practical perspective, as U will be embodied in physical hardware, it 
will be easier to perform learning by varying just |i/t(o)) and/or At. Varying At 

does, of course, change the unitary transformation applied during the coherent evolu- 
tion phase, so it achieves a similar effect to picking a different U. 

An arbitrary NxN dimensional unitary matrix has exactly free parame- 
ters. Therefore, in principle, a one dimensional QRN can generate a stochastic attrac- 
tor having up to degrees of freedom. The method used to find a unitary matrix U 
that will generate a stochastic process with the desired properties, could be based on 
analytic minimization, gradient descent or genetic algorithms [Hertz, et al. 91]. 



Simulating an Eight State Markovian Process 



Suppose we want to generate an 8 state Markov process that would generate the sto- 
chastic attractor shown in Fig. 4. 
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Fig. 4. Stochastic attractor defined on 8 states. 
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For this attractor, the probabilities of seeing states 0 through 7 are given by 0.119, 
0.085, 0.108, 0.055, 0.341, 0.070, 0.156, and 0.066 respectively. As the attractor is 
one dimensional it can be obtained using a one dimensional QRN acting on 3 qubits. 
To design such a QRN, we begin with a random unitary matrix, U 
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-.405+.251/ 


.167-.16/ 


-.308+.203/ 


.276-.184/ 


336+.161/ 


-.141-.236/ 


-.189-333/ 


-.147-.117/ 


.149+.297/ 


-365+.353Z 


-.229-.15/ 


.19+.042/ 


-.197+.192/ 


-.08-3351' 


-.253+.498Z 


.065+.278/ 


-.178-.103/ 


-.15+.164/ 


337-.187/ 


.239-.083/ 


-.274-33/ 


.221-338/ 


.205+.041/ 


-.101+.625/ 


.177-.099/ 


-.123+.003Z 


.02-.032/ 


-.417+.111/ 


.024+383/ 


.18+.105/ 


.172+.342/ 


-.136-.222/ 


.019+.417/ 


-.065+.345/ 


314-.189/ 


-.103+339/ 


.429-.007/ 



Given U , we can “train” the QRN to generate the desired stochastic attractor simply 
by varying the offset state, 1 1/^(0)^ . The training is complete when the difference 
between the QRN’s stochastic attractor and the target stochastic attractor is less than 
some threshold. For the given U and the given target attractor, a satisfactory offset 
state vector is given by: 

|(ir(0)) = (-.093-.368i)|000)-(.389+.31h') |00l)-(.029-.114i) |010) + (.109-.221i) |01l) + 
(.325-.347i) |l00)-(.290p.l62i) 1 10l)-(.330-.165i) |ll0)+.244 |lll) 



1000 iterations of the QRN having this U and | induces the stochastic attractor 
shown in Fig. 5. 
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Fig. 5. The attractor generated by the “trained” QRN. 



As you can see the agreement between the target attractor and the QRN’s attractor is 
very good. Moreover, one also learns a plausible transition probability matrix for the 
Markov chain that induces the target attractor. 
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.243 


.009 


.236 


.088 


.121 


.045 


.201 


.057 


.173 


.003 


.090 


.069 


.151 


.343 


.048 


.123 


.147 


.025 


.319 


.061 


.168 


.066 


.155 


.059 


.026 


.033 


.243 


.187 


.371 


.022 


.066 


.052 


.129 


.152 


.028 


.001 


.537 


.036 


.095 


.022 


.108 


.036 


.010 


.203 


.244 


.069 


.118 


.021 


.016 


.139 


.030 


.021 


.319 


.028 


.409 


.038 


.059 


.052 


.148 


.063 


.378 


.082 


.048 


.170 



This transition probability matrix is not unique because, if the attractor contains N 
degrees of freedom, the corresponding transition probability matrix contains - N 
degrees of freedom. Thus the transition probability matrix is not as constrained as the 
attractor itself. 



The A:-Parallel Case 

Next we generalize the concept of a quantum recurrent network to the case in which 
there are k networks working in parallel (see Fig. 6). 




Fig. 6. The k-parallel quantum recurrent network 

During the quantum evolution and measurement phases each network acts in- 
dependently of the rest. However, during the reset operation the results of all the 
measurements are combined with the initial state to yield k identical input states. 
Note that the reset operation does not require an arbitrary quantum state to be cloned 
(which is impossible). Instead it only requires that k classical states, the outcomes of 
the k independent measurements, be copied. As this information is purely classical it 
can be copied freely. Moreover, the initial state |v^(o)^ is known and can be gener- 
ated afresh as needed by each of the k networks. As a result, the feedback process we 
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propose is guaranteed to be physically realizable. The purpose of moving to the k- 
parallel quantum recurrent network is to permit us to generate multi-dimensional 
stochastic attractors. The reset operation gives us the flexibility to introduce correla- 
tions between the attractors in each dimension. 

For the k-parallel QRN, the elements of the transition probability matrix now 
define the probability of making transitions between sets of measurement outcomes. 
The state entering each of the k networks at each iteration will have a form such as: 





Uq 


1 \ 1 


(¥2-4) 


\rh \ — 


1 


r(V2..4)/ /„ 

V (‘1‘2-4) 


(/1I2..4) 

r^-1 ; 



where the sequence i^i^...i^ specifies the last ordered set of measurement outcomes 
obtained from the k networks and R. -^ is the renormalization constant given by: 



R 






\a 



(l\l2-.Jk) 

0 



|2 






-h... 



a 



N-l 



|2 



The mathematical form of the amplitude depends upon how many of the 

components in the k-parallel network produced the same measurement outcome at the 
last iteration through the QRN i.e. how many of the i^ , in the sequence ij/j ...ij, were 



the same. If outcome is obtained n-^ times we have = n-^ -i- a^(o). 

As there are k networks and each network can produce one of N outcomes (independ- 
ently), the k-parallel transition matrix defines a mapping from distinct sets of 
input states to A* sets of output states. If we denote the probability of transitioning 
from the set of inputs to the set of outputs jj 2 ---Jk by we have: 

_ h 

where 

,v=l ■ £=0 

Thus the ^-parallel transition probability matrix has a tensor structure of the form 
where the sequences i^i^-.-h and are defined with 





^2 ly(hh---k) 








J2 




Jk 








sJ^(V2--‘k) 



respect to some consistent ordering. 

For the k-parallel architecture, there are k free parameters. Thus we ought to be 



able to generate k-dimensional stochastic attractors having up to kN^ degrees of 
freedom. 

An interesting corollary of the QRN dynamics concerns the dynamical be- 
havior of two k=l QRNs in comparison to a single k=2 QRN that combines them 
both. For simplicity we can set the initial (offset) state vector | y/{o)) to be zero. Con- 
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sidered separately, the resulting stochastic processes have transition probabilities 
p*'*' and given by: 





2 

„(i2> _ 


J2‘2 




’ Pjl - 





By contrast the transition probability matrix of the joint QRN has components: 



Jl h 



r(2) 



-\-U 

^I'l h’’! 






r(2) 



I/'.'' +u 

Jlh 72*2 






Clearly ^ X p*^* in general. Thus the two dimensional stochastic attractor 

generated by a 2-parallel quantum recurrent network is not simply the product of two 
1 -dimensional stochastic attractors. 



Alternative Reset Strategies 



So far, we have described the most general quantum recurrent network, i.e., one 
which involves an arbitrary offset state and an arbitrary unitary operator. Unfortu- 
nately, for such QRNs, the reset operation requires a circuit, like that depicted in Fig. 
3, which is exponential in the number of qubits. This is because the required reset is 
different depending on which of the 2" possible outcomes was obtained at the last 
measurement. Moreover, to implement an arbitrary unitary matrix as a quantum 
circuit could, in the worst case, require an exponential number of quantum gates[Knill 
95]. Both these shortcomings can be sidestepped, however, by a slight modification 
to the QRN. 

Instead of allowing any multi-particle state vector to serve as the offset vector 
we could allow only product states. This would enable the reset operation to be 
achieved in only a polynomial number of operations. For example, suppose the offset 
vector for a 2-qubit QRN is the product state then, instead of the reset 



operation with exponential cost, i.e., if a and b are the most recently measured clas- 
sical outcomes, I | !^)il^)2 “''l'^)il^)2 11’ could impose a qubit-by- 



qubit reset operation | (//•^ J 



(I f^)l+l«)l)(k)2+l^)2) 



Although the latter 



operation is conditional too, the conditioning is with respect to each individual qubit 
rather than the state of the entire multi-qubit register. Thus, the polynomial cost reset 
circuit would have the form shown in Fig. 7. 
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Fig. 7 A polynomially sized reset circuit. 



Alternatively, we can imagine another reset strategy inspired by Brassard et 
al’s discovery of a notion of the “closest” product state to an arbitrary entangled 
state[Brassard & Mor 98]. As we know the ideal entangled state that we should like 
to feed back into the QRN, we could instead use the closest product state to it. This 
would perhaps be the best compromise reset, as it would approximate the generic 
QRN and yet still only require a polynomial cost reset operation. 

Likewise, rather than allowing U to be an arbitrary unitary operation, we could 
require that {/ be a unitary operation that is implementable in a polynomially sized 
quantum circuit. That is, we restrict consideration to unitary operators U that can be 
factored in the form t/ = (l7j ' (^i ••• where {/; and T,- are 

simple unitary operators describing 1 -qubit or 2-qubit gates and the total number of 
terms in the product is bounded by a polynomial in the number of qubits. As a given 
stochastic attractor can, in general, be obtained from several different transition prob- 
ability matrices, we have some degree of latitude over the exact choice of U. Thus 
restricting attention to 17’ s having a special (compact) decomposition ought not to be 
that limiting. 



Summary 

Quantum recurrent networks provide a mechanism for generating true stochastic 
attractors. By a process of learning we can tune the free parameters in a QRN to pro- 
duce stochastic attractors having prescribed characteristics, such as a specific fre- 
quency distribution for the states visited. Moreover, as the QRNs operate by inter- 
leaving quantum evolution with measurement and reset operations, they are far less 
sensitive to decoherence than other designs of quantum computers. 

Stochastic attractors find a wide range of applications in the physical and 
computational sciences. For example, one could use quantum recurrent networks as 
associative memories. Different stimuli, represented by different inputs would 

induce different stochastic attractors. The capacity of such quantum associative 
memories i.e. the number of distinct stimuli that can be recognized without unaccept- 
able error, is much higher than for a comparable classical network. 
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Abstract. We study a special inhomogeneous quantum network con- 
sisting of a ring of M pseudo-spins (here M = 4) sequentially coupled 
to one and the same central spin under the influence of given pulse se- 
quences (quantum gate operations). This architecture could be visual- 
ized as a quantum Turing machine with a cyclic “tape”. Rather than 
input-output-relations we investigate the resulting process, i.e. the cor- 
relation between one- and two-point expectation values (“correlations”) 
over various time-steps. The resulting spatiotemporal pattern exhibits 
many non-classical features including Zeno-effects, violation of temporal 
Bell-inequalities and quantum parallelism. Due to the strange web of cor- 
relations being built-up, specihc measurement outcomes for the tape may 
refer to one or several preparation histories of the head. Specihc families 
of correlation functions are more stable with respect to dissipation than 
the total wave-function. 



1 Introduction 



It has been shown that certain computational problems scale more favorably 
when carried out on a quantum system than on any classical computer (see 
e.g. P). The underlying “quantum complexity” may thus reduce computational 
complexity. Architectures for abstract quantum networks appropriate for such 
potential applications have been discussed (cf. e.g. [ 2 ]). 

On the other hand, the control of such quantum networks appears to scale 
very badly with system size: In fact, in the language of statistical physics, that 
control would amount to use “micro-states” rather than “macro-states” , a chal- 
lenging undertaking, indeed. It should therefore not come as a surprise that 
virtually all proposals up to now face severe problems when trying to go beyond 
the (coherent) control of something like TV = 10 pseudo-spins j,4f4p5|ti^ . Also the 
detailed theoretical simulation will become increasingly difficult if not eventu- 
ally impossible beyond that limit. Fortunately, as will be shown below, even 
such small networks may show a surprisingly rich behavior in terms of correla- 
tion functions. Rather than the entanglement as such this pattern of correlations 
should be considered as the basis of the expected computational efficiency as well 
as other potential applications. 



C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 89-E221 1999- 
© Springer- Verlag Berlin Heidelberg 1999 
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2 Composite Systems 

2.1 States 

The system we are going to investigate here is composed of M + 1 spins, /i = 
S', 1, 2, • • • , M. The respective states are >,p = 0,1. The corresponding 

product basis is \u{M) ■ ■ ■ r{2)q{l)p{S) >= |u • • • rqp >. Arranged in the order 
of increasing binary numbers we also introduce the single-index notation |s >, 
s = 0, 1, • • • , 2^+^ — 1 by identifying |0 >= |0 • • • 000 >, |1 >= |0 • • • 001 >, 
|2 >= |0 • • • 010 >, etc. This single-index representation will not only serve as a 
means to simplify some algebra. It reminds us that one can entirely avoid talking 
about entanglement while, nevertheless, keeping the product-space background 
still operative, though in a more subtle way: In terms of the specific operator 
combinations and their expectation values. 



2.2 Cluster- Operators 



For M -I- 1 = 5 there are (2®)^ = 1024 orthogonal basis operators. One possible 
choice would be products of local transition-operators, Ppq{p) = \p{p) >< q{p)\- 
For reasons that will become clear shortly it is more convenient to separate 
out the local unit operators l(/r) so that the remaining operators become trace- 
less. Such a scheme is provided by the Hermitian and unitary S'[/(2)-generators, 

^i(m) = Poi{p) + Ao(m) 

\2{p) = iPoiip) - iPioip) Qx 

A(m) = Ai(m) - Ao(m) 

Xo{p) = Ai(m) + Pooip) = Hp) ■ 

The corresponding product operators {j,k,l,m,n = 0, 1,2,3) 0 

Qjkin^n = A(*5)A(1)A(2)A™(3)A„(4) (2) 

with {QjkimnP = i for any {j,k,l,m,n) and 



Pi'iQjkl mnQ k'l'm'n'} — 2 Sii'S. 






mm' '~^nn 



(3) 



then come in 6 classes, depending on the number c = 0,l,---,M-|-lof sub- 
systems they act on, i.e. the number of indices unequal zero. Qooooo = 1 is 
the only c = 0 cluster operator. When transcribed to the single index-space, 
s = 0, 1, • • • , 2® — 1, these operators appear like a set of “generalized” 51/(2)- 
operators of the form given in eq. (CQ) with each single transition or projection 
operator replaced by a group of 2^ = 16. Such operator combinations would 
be hard if not impossible to implement in a simple one-particle system with 2® 
states; they reflect the structure of the underlying product space. Correspond- 
ingly, the expectation-value of any cluster-operator is a sum of 2^+^ density 



Correlation between Correlations: Process and Time in Quantum Networks 



91 



matrix elements in the single-index space. Examples for M = 2 are the c = 1- 
cluster operators, 



Qsoo — (Ai + As + As + A?) ~ (Ao + A2 + A4 + Ae) 
Qoso = (A2 + As + Ae + A?) ~ (Ao + Ai + A4 + As) 



or c = 2-cluster operators such as, 

Qsso = (Ao + As + A4 + At ) — (Ai + P22 + As + Ae) /g-. 

Qsos = (Ao + P22 + P55 + At ) — (Ai + As + A4 + Ae) • 

Any operator A in the 2 ^-dimensional Hilbert-space of spin-states can be 
represented as (summation over repeated indices). 



A = 



1 

¥ 



AjklmnQ jkln 



( 6 ) 



with the parameters 

Ajklmn — P^i^AQ . (7) 

(Tr means trace over the total Hilbert-space.) In particular, the network Hamil- 
tonian H can be specified by the model parameters Hjkimn', they are usually 
constrained to c=0, 1 and 2-cluster-terms [Zj. The density operator p is uniquely 
defined by the set of expectation-values (note that Qjkimn is unitary) 



— 1 < Kjki mn — jkl mn } < 1 ( 8 ) 

with c-cluster-operators defining c-particle correlations. For a pure state, p = 
\tp >< tp\, eq. (0 reduces to 

P-jklmn 3klmn\‘4^ ^ • ( 9 ) 



By definition, ATooooo = 1; the local Bloch-vectors ATjoooo, ATofcooo, ATooioo, etc. 
{j,k,l = 1,2,3) are equivalent to the respective reduced density matrices. A 
pure local state has Bloch- vector-length 1. For so-called product states all these 
correlations factor into one-point functions, i.e. 



Kjklmn — KjOOOO ' KokOOO ' KoOlOO ' KoQOmO ' KoOOOn 

but, in general, they are independent. Local realism (cf. 0), to be sure, postu- 
lates that an appropriate distribution of local variables (eigenvalues Xj = ±1) 
could explain all these correlation functions rendering them statistically depen- 
dent; at least for larger networks this approach is no longer tenable. On the 
other hand, as will be shown below, the quantum mechanical evolution gener- 
ates “correlations between correlations” . 

For later reference we also define symmetrized correlation functions within 
one and the same system p: 

(^AB = ^(Tr{/5A(Ai)i3(/r)}-t-Tr{pH(^)i(/r)}) . 



( 10 ) 
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Restricting ourselves to traceless operators, this correlation is independent of p 
(for two-dimensional Hilbert-spaces) and can simply be written as the normalized 
scalar product between the two representing vectors |2j; for p. = S, e.g., 



'-'AB — 



210 



Ijoooo-Bjoooo 



( 11 ) 



2.3 Unitary Transformations 

A unitary transformation of an operator A, 

A' = UAU+ (12) 

with [/+(/ = [/(/■*■ = 1, reads in terms of the SU{2)~ parameters, 



Af A 

■^jklmn ~ k I m n -^j'k'l'm'n' 



(13) 



where 

^ Tr{U+Q,UmnUQpk'l'm'n'} (14) 

(For U = 1, X is just the unit matrix, see eq. ()3I)). There are different types: We 
may distinguish transformations which operate in certain subspaces only. The 
locally selective transformation U (S') in the 2-dimensional local Hilbert-space of 
S, e.g., is equivalent to a local rotation of the S[/(2)-parameters with respect to 
the first index j, generated by (cf. H) 



Y^j'k'l'm'n A A A 

^jklmn — ^jj' Okk'Oll'Omm'Onn' 

xjf; = lTrs{U+(S)A,(S)U(S)Ay(S)} . 

(Here, Trs means trace over the subspace of S only.) As = 1, Ab/ = 0 if 
either j or j' is zero, all parameters Aokimn are invariants. Correspondingly, a 
unitary transformation U{S, 1) leaves the expectation values Aoozmrt unchanged, 
etc. These invariants (conservation laws) are important characteristics of the 
respective transformations. 



2.4 Time 

As we do not consider equations of motion explicitly, time enters at most indi- 
rectly: To specify change, order and duration. For closed systems, unitary trans- 
formations are the only allowed type of changes (of states or observables) in 
closed quantum systems. Typically they are generated by the underlying Hamil- 
ton model. In the Schrodinger-picture this unitary transformation is applied to 
p, in the Heisenberg-picture the inverse transformation (replacing U by and 
vice versa) is applied to the observables. 

Parameter time T will come in with respect to the order, in which certain 
transformations are applied, as a continuous parometer controlling the individual 
transformation quantitatively (“pulse length” t), and, eventually, with respect to 
the order of measurements. Finally, the induced dynamics can be characterized 
by correlation-and recurrence-times. 
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3 The Turing Model 

Our system is sketched in Fig 0 Spin S is the “Turing head”, the other ^ = 
= 4 subsystems denote memories (as part of a circular “Turing 
tape”); the latter do not interact directly and are separated by “empty ” cells. 
The head interacts with at most one cell at a time P33; it moves clockwise and 
step by step to one of the 2M positions on the tape; there is no need for a feed- 
back between the internal quantum state of the network and this pre-determined 
“classical” movement. 




Fig. 1. Quantum Turing machine {M = 4). The circular Turing tape consists 
of fj, = memory cells (position 2/r) separated by empty cells (position 

2/x — 1). The Turing head moves clockwise thus initiating a local (position-index 
odd) or a pair transformation, respectively (position index even) . 



We assume to have explicit control over the model parameters Hjkimn defin- 
ing the Hamiltonian, which may even be modified in terms of pulses in parameter- 
time {tj is the pulse-length): 



H{t) — Hj for Tj—i ^ t ^ Tj—i -1- tj = Tj . 



(16) 
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Granted this access we can implement virtually any unitary transformation via 

T,_i) = U, = (17) 

(j is the step number), though this may seriously be limited in practice. If the 
Turing head is over an empty cell (tape position 2/r— 1), a local transformation 
Ua{S) on S is applied, if it is in contact with a memory cell /i at position 2/x a 
pair transformation on (5", /r) is induced (/r = 1, 2, • • • , M). 



3.1 Local Transformation on {S) 



Let us consider the one-parameter-form 



|0(5') > — >cos(a/2)|0(5') > -isin (0/2)11(5) > 
jl(5) > — > — zsin (q:/2)|0(5) > -I- cos (a/2) 1 1(5) > 



which can be generated by (M = 4) 

Ua{S) = Qooooo cos (a/2) — Qioooo * sin (a/2) = C//l„(5) . (19) 



According to eqs. (Il9ll lit oil and m, we find 

Xjp = cos^ (a/ 2 ) Sjj’ + |sin^ (a/2)Tr5{AiAj AiAj/} 

( 20 ) 

-|-|sina Tr 5 {AiAjAj' — AjAiAj'} 

so that ^ = 1, ^ 2 ^^ = = cos a, Agf^ = — A 23 ^ = sin a. (Here 

/ C\ 

and in the following all terms not explicitly given are zero.) This matrix Ab 
defines a rotation of the Bloch- vector of 5 around the fc=l-axis in the 2,3-plane. 
The phase a may be taken to result from a pulse of duration t 



a = gt (21) 

where g would be the coupling strength to an external optical driving field. The 
correlation function between A = Aa(5) transformed by 4> and the same operator 
transformed by phase angle cj) + a then is, according to eq. d, 

C'33 ^ (/>j /> + a) = cos a . (22) 

Based on eq. d2U this expectation value can be interpreted as a 2-time 1-particle 
correlation function in the Heisenberg-picture. Combinations of these have been 
shown to violate temporal Bell inequalities El- 
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3.2 Pair Transformation on {S, fi) 

This unitary transformation is taken as the conditioned 7r-pulse, (g = 0,1) 



Resonance: |0(S')0(^) > < — *■ |0(S')1(^) > 
Off-resonance: \l{S)qlfi) > < — > jl(S')q(^) > 



which we may write, in terms of cluster operators, in the form 

U{S, 1) = Poo(5)Ai(l) + Ai(^)i(l) 



— 5 (Qooooo + Qsoooo + Qoiooo — Osiooo) — U^{S, 1 ) . 



(24) 



These operators U {S, fj.) commute; their implementation requires pair interac- 
tions, which make the transition frequency in subsystem /i depend on the state of 
subsystem S This transformation has become known as the (quantum-) 

controlled NOT 0, as subsystem S acts as a control for a 7r-pulse on We may 
associate a fixed pulse duration to with this implementation; here we assume 
to ~ 0. In general, the two types of unitary operators do not commute: 

[U{S,f,),Uc.{S)]=sm{a/2){i{fl)-X,{^l))X2{S) . (25) 



4 The Process 



4.1 The First Cycle 



We are now in a position to follow up the ordered sequence of 2M = 8 unitary 
transformations. 



1^(1 j ) > 



(26) 



where (/r = 1, 2, • • • , M) 

U2^ = U{S,fi) . 



(27) 



Here and in the following the upper index pair in parenthesis denotes the cycle 
number m and the step number j, respectively. With j = 2fi (fi = 1,2, ■■■ , M) 
we may associate the time (cf. eq. (12 111 1 



11 fi 

?2/x = t2i-i = cni/g « T2/X-1 ■ (28) 



T 2 M is then the time needed for each cycle. Now, let the initial state be >= 

|0 >= |00000 > so that the local Bloch-vectors are given by 



30000 ~ -^03000 



'( 1 . 0 ) 



_ ^(1.0) _ _1 
~ -1^00003 ~ 1 



(29) 
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In the first step we apply the local transformation with a phase leading to 
|^(i,i) >_ cos(ai/2) |0 > — isin(ai/2) |1 > . (30) 

In the second step we execute the pair transformation on (S,l): 

>= cos{ai/2) \2 > —is\n{ai/2) \1 > . (31) 



In the third step we again apply the local transformation, now with phase a 2 , 
leading to 



> = COS (ai/2) COS ( 02 / 2 ) |2 > —i cos (ai/2) sin ( 02 / 2 ) |3 > 

—i sin (ai/2) cos ( 02 / 2 ) |1 > — sin (ai/2) sin ( 02 / 2 ) |0 > . 



(32) 



In the “Heisenberg-picture” , this implies between step 2 and step 3 the local cor- 
relation as given by eq. (E2I with a = tt 2 . In the 4th step the pair transformation 
on (S', 2) implies 



> = COS (ai/2) COS (a2/2) |6 > —f cos (ai/2) sin ( 02 / 2 ) |3 > 

— i sin (ai/2) cos ( 02 / 2 ) |1 > — sin (ai/2) sin (a2/2) |4 > . 



(33) 



This procedure is continued with respect to the next memory cells 3 and 4 (steps 
5 through 8). We note that the single-subsystem expectation values of subsystem 
S and ^ obey the relations 



-^30000 

^( 14 ) 

-^30000 

4^10000 



= -K, 



( 1 , 2 ) 



03000 ~ -^30000 *- 0 ® ®1 

~ 4\q3qqq — -0.30000 cos U!2 



etc. 



(34) 



= K. 



(l,2/.«) 

20000 



= 0 



and as a consequence of the controlled-NOT-logic (cf. eq. (0), 



^( 1 , 2 ) 

41-33000 ~ 4 \ 30300 



= k: 



( 1 . 6 ) 

30030 



_ ^(1,8) _ 

~ 30003 — i 



(35) 



We thus see that the two systems, S and are strictly anti-correlated after 
step 2/r (the state |^d,2) g jg actually an eigenstate of QsooooO? while the 
local Bloch- vector-lengths are less than 1, i.e. local properties are not dispersion- 
free (“fuzzy”). This is typical for non-classical correlations. There can be strict 
correlations between fuzzy subsystems. 



4.2 Cycles m > 1 

We can summarize and generalize the above results by introducing the following 
functions: 



K('"dAi)(Q,^^ ...aj) = 

-[cos (moi) cos ( 77102 ) ■ • • cos (moj)] -I- - 



1 m even 

cos oi cos 02 • • • cos Oj m odd. 
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^(m, 2 M) above with cosmai replaced by sinmai, cosai replaced by — sinai 
and the 1 replaced by 0 {j < M), 

= _ cos (mai/2) cos (ma 2 / 2 ) ■ • • cos (maM/2) m even, 

^{ m , 2M ) ^ cos ((m + 1 )q!2/2) • • • cos ((m + l)o!fe/2)) 

X cos ((m — l)afe+i/2) • • • cos ((m — l)aM/2) m odd, 



and with (m + 1) replaced by (m — 1) and vice versa. 

Then, at the end of each cycle m, the Turing head can be described by 
(M = 4) 

-^10000 ~ 

^20000 = «:i™’®^(ai,a2,a3,Q;4) (36) 

^30000 = -K^’"’®Hai:a2,a3,a4) 

and the memory cells by 



-^03000 

-^00300 

-^03300 

K. 






= K-h 






00330 



etc. 
’® H « 3 ) 



^ 0^033 = 



'■00033 

■^03030 

j^{m,8) 

■''■00303 

j^{m,8) 

■''03003 









Q!2 



0 : 3 ) 

04 ) 

0:3, 0:4) . 



(37) 



The memory pair-correlations are all positive for m even and decay with “step 
distance”, i.e. the number of intermediate rotation and coupling steps to other 
memory cells (cf. also Fig 14.211 . The pair correlations between Turing head and 
the memories are given by 



T^(m,8) _ (m.8) 

■''33000 ~ A1 
^(m.8) _ (m.8) 

■''30300 “ A2 



etc. 



(38) 



All the expectation values are strictly periodic in m if aj = for all 

j = 1,2, ■ ■ ■ , M with pj a whole number. The period p is then the smallest even 
number that has all these pj as factors. 

In a similar way one obtains the results for step numbers smaller than 2M = 
8. Generalizations to the situation where the phase angles differ from cycle to 
cycle are also straight-forward. For example, based on egs. (j,S7j) . (I.SiSIl we find a 
web of correlations like 



■''33000 ■ ■''03000 ~ ■''30300 



■''00300 ~ ■''30030 ■ ■''00030 



etc. 



(39) 



valid for all steps j within any cycle m. 
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Fig. 2. Alternative histories, a. Decision tree with respect to step number 2/r 
= 2, 4, 6, 8, as realized in an ensemble of non-interacting spins S under the 
series of local transformations C/q,^ (5), but with immediate actual measurements 
(replacing U{S, fi) of our Turing machine) at times T 2 ^. b. For the single Turing 
machine all the possible histories are yet undecided and associated with the 
states of the Turing tape as given. 



5 Reduced Descriptions 
5.1 Turing-Head S 

The description reduced to the subsystem S is based on the local Bloch-vector 
AlyoooO) j = 1; 2, 3 only. Starting from the ground-state, ATg^QQQ = — 1, this vector 
is subject to the rotation as given by eq. II 1 911 . We see that each controlled NOT 
operation implies a projection on the 3-axis (ATioooo = Al 2 oooo = 0). The result 
of eq. m for cycle 1 is easily generalized to M > 4 with n = 1, 2, • • • , M and 
= 7 t/M. We find 

kHoT.. = -cos^{7t/M) . (40) 



Correlation between Correlations: Process and Time in Quantum Networks 



99 



With a^, = gt 2 u-i (cf. eq. — 1 is the step number), the quantum- 

Zeno-effect results within the fixed time gT 2 M = tt (cf. eq. 

It is interesting to note that the reduced density matrix (or Bloch-vector) of 
subsystem S is, at any time t, identical with the density matrix of an ensemble of 
non-interacting spins (all with the same initial state and subject to the same local 
unitary transformation) but actually measured at each time T21,, v = 1, 2, • • • , M . 
For each ensemble member the series of measurements constitutes a “decision- 
tree”, with each measurement result given by ^^30000 = 2). The 

ensemble average over these trajectories leads back to the behavior realized here 
by just one single object! The respective density matrices are identical. This is 
what one may call quantum parallelism. The interaction with the tape generates 
a dynamical evolution of the Turing head S equivalent to 2^ different histories 
(cf. Pj), clearly an exponential gain. This will only hold, though, as long as no 
measurements are performed. 

5.2 Turing- Tape 

Contrary to the Turing head S, the other subsystems are each addressed by 
unitary transformations only once (within each cycle). Due to the built-in logic 
the state of subsystem 1 is strictly anti-correlated with S after preparation step 
2, subsystem 2 is anti-correlated with S after step 4, and so on. This means 
that an actual projective measurement performed on these subsystems would 
reveal also the respective states of S. When the transformations are interpreted 
to happen in parameter-time T2^, the subsystems g ^ S indeed act as a kind 
of “memory” . They allow delayed measurements on S. One may argue that this 
fact is the origin of the quantum-Zeno-effect discussed in Sect. (j5. It : It suffices 
to be able to measure in order to get the freezing-tendency of measurements 
(“virtual watchdog”- effect). 

Local measurements of the memory cells amounts to the application of a 
projection or transition-operator like Poi(h')- As these operators commute among 
each other (for different fi) and with any of the unitary operators not acting on 
/i, we can postpone these measurements up to one cycle. For /i = 1, e.g., 

U(SA)UaAS) ■ ■ ■ >= 

A)i(1)17(5,4);7„,(5)...17(5,1)P(5)„JV’^'"’0) > . ^ ^ 

Let us first restrict ourselves to cycle m = 1 with its decision tree iFig 14.211 . 
The time order of these measurements (i.e. the measurement process) need not 
correspond to the time-order, in which the memory cells have been visited by the 
Turing head: The actual history for the latter (out of the possibilities as shown 
in Fig may thus be “realized” even backward in time! 

But not only this: The correlation between memory cell 1 and 2, e.g., must, by 
construction (cf. eq. (15511 1 and the invariance property ATq 33 oo = A'Qgg'^Q, reflect 
the correlation between the states of S taken at T 2 and T4, respectively. This is 
readily verified by comparing our result for ATgggQQ, eq. (15 /11 . with C'33'^ given by 
eq. (then a two-time correlation function in the Heisenberg-picture) . The 
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fact that ^03300 ^ 33 ^ identical means, that a measurement of ^03300 

can be used to infer the unperturbed {T 2 ,T 4 ). This holds, correspondingly, 
for -ftioo 330 -^ 0003 ^ sense time-correlations of the past still “coexist”. 

As we continue into the cycles m > 1, the unique identification of tape state 
and head history is gradually lost: histories become undecidable. The “meaning” 
of those measurements thus strongly depends on the step- and cycle number. At 
the end of cycle m + p, to be sure, the original situation is restored. The time- 
parameters T 2 fj, labelling those histories are thus defined only modulo pTg (if 
period p exists). 



6 Special Machines 

6.1 A “Coin- Tossing Machine” 

For the machine defined by = 7r/2;/r = 1,2, 3, 4} all pair correlations and 

all one-point expectation-values are zero by the end of cycle m = 1 (cf. eqs. 
(E3, ®). The resulting histories all have the same probability and look like 
those of independent coin tossings at the times T 2 ^. As for the “Zeno-machine” 
= tt/M; p= 1, 2, • • • , M}, a complete measurement of the tape state at the 
end of cycle m = 1 would allow us to reconstruct the history of S. The period 
is p = 4. 



6.2 A “Cat Machine” 



As a next example let us consider the Turing machine defined by {ai = tt/2, 02 = 
0:3 = 04 = 0}. The period is p = 8, again independent of M: >= 

|^(m+8j) gjjgj gf gygig jYi all memory cells are strictly correlated 

(cf. eq. fTTll 'l. Furthermore, 



|^(i.8) (111110 > -i|00001 >) 

v2 



(42) 



is found to be a so-called cat-state, for which the decision tree of Fig collapses 
to two histories only, (fill) and (0000), respectively. Moreover > is a 

different one. As a process the built-up of these cat-states is thus quite simple. 
While cat states are reduced to product states by the decay (measurement) of 
any individual subsystem, all the memory pair correlations discussed here remain 
intact as long as the decaying subsystem is not part of that very pair. 



6.3 Large-Scale Predictability 

For m = 100 < p and M -|- 1 = 10 we would have roughly m2^ « 5 • 10“^ 
transformations in a 2^+^ « 1000-dimensional Hilbert-space; nevertheless, the 
calculation of these expectation values would scale, at most, linearly with M, 
independent of m! This indicates that simulations even of large networks could 
become feasible based on such rules. Of course, the number of expectation values 
increases exponentially with the system size M -|- 1. 
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7 Conclusions 

We have discussed the dynamics of a special quantum network, which combines 
quantum-mechanical and classical features: The quantum-mechanical variables 
consist of a “Turing head” (pseudospin S) and a “Turing tape” (M memory 
spins). Classical variables are the phenomenological Hamilton-parameters, which 
are switched externally to generate discrete unitary transformations. The ma- 
chine behavior is defined by its initial state and the phase angles specifying 
those transformations. 

This switching can be visualized as being induced by the Turing head per- 
forming pre-determined cycles over 2M Turing head positions. Correlations in 
terms of multi-point expectation values are built up in this process. Time defines 
the order of non-commuting operations and quantitatively controls transforma- 
tion parameters. 

The structure of these correlations may be attributed to the notorious “holis- 
tic nature” of quantum mechanics. Nevertheless, this built-up follows a strict 
logic: the type of admissible manipulations (rotations) is severely constrained 
in all but the simplest 2-level-space; this observation certainly applies to our 
present 2^+^-level-model. Additional constraints are built in by the selection 
of transformations which are actually implemented. Here they relate to the fact 
that the multi-levels actually refer to M -|- 1 subsystems. These constraints are 
reflected by the spatio-temporal pattern of correlations. 

There is probably good news and there is bad news as far as the consequences 
are concerned: The bad news is that the implementation of specific processes is 
much more constrained in the quantum regime than in the macroscopic world; 
this makes experimental progress in quantum computation depressively slow. 

The good news could be that, eventually, only constrained systems can make 
up a useful machinery; systems with large, unrestricted state spaces (like a free 
gas) are “useless”. The constraints are something like fixed axles, wheels, and 
connecting rods in classical mechanics. Under fairly moderate conditions those 
correlations and the correlation between correlations should constitute a machine 
behavior. Rather than enforcing some specific behavior defined by abstract al- 
gorithms we might be better off trying to exploit the experimental repertoire of 
real quantum networks. 
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Abstract. We discuss some seemingly paradoxical yet valid effects of 
quantum physics in information processing. Firstly, we argue that the act 
of “doing nothing” on part of an entangled quantum system is a highly 
non-trivial operation and that it is the essential ingredient underlying the 
computational speedup in the known quantum algorithms. Secondly, we 
show that the watched pot effect of quantum measurement theory gives 
the following novel computational possibility: suppose that we have a 
quantum computer with an on/off switch, programmed ready to solve a 
decision problem. Then (in certain circumstances) the mere fact that the 
computer would have given the answer if it were run, is enough for us to 
learn the answer, even though the computer is in fact not run. 



1 Introduction 

Many recent developments in quantum computation are motivated by existing 
results in theoretical computer science, adapted and rewritten in a quantum con- 
text. This includes much of the recent work on quantum error correcting codes 
(see for example jdtflhj ) and also the idea of using the Fourier transform to de- 
termine periodicity, which underlies many of the known quantum algorithms Q • 
There are relatively few results (such as 0) with no classical analogue, motivated 
intrinsically from considerations of physics. This is a curious situation consid- 
ering that the entire subject of quantum computation derives from differences 
between the classical and quantum laws of physics. Apart from the computer 
science benefits of providing more efficient computation, an important funda- 
mental aspect of the subject is the insight that it might provide for a deeper 
understanding of the quantum laws and their origins. Computer science and in- 
formation theory provide an entirely new conceptual framework for considering 
this question of physics. Thus we will consider the question: what are the essen- 
tial physical effects that give rise to the known computational speedups? And 
is it possible to use other differences between quantum and classical physics for 
novel computational possibilities? 

2 Quantum Information Processing and Entanglement 

It is often said that the power of quantum computation derives from the su- 
perposition principle - the ability to do different computations in parallel in 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. in.l- nT?l 1999. 

© Springer- Verlag Berlin Heidelberg 1999 
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superposition, and combine the results with cleverly arranged interferences. But 
this explanation is not precise enough because classical waves also exhibit su- 
perposition and any effect of superposition can be mimicked by a classical wave 
system. However there is an essential difference between classical and quantum 
superposition, which lies in the different way that the two physical theories de- 
scribe composite systems |2|. 

Consider n two-level systems. In the classical case we may for example think 
of each system as comprising the two lowest energy modes of vibration of a 
string with fixed endpoints together with all superpositions. According to the 
laws of classical mechanics, the total state space of the composite system is the 
Cartesian product of the n subsystem spaces. Thus no matter how much the 
strings interact in their physical evolution, the total state is always a product 
state of the n separate systems. Hence we can say that the information needed 
to describe the total state grows linearly with n (being n times the information 
needed to describe a single subsystem). 

In contrast, according to the laws of quantum mechanics the total space is 
the tensor product of the subsystem spaces and a general state may be written 
as 

1 

|'0n) = ^ ^ flzi...in l*l) ■ • ■ l*n) (1) 

Thus generally we will have 0(2") superposition components present and the 
information needed to describe the total state will grow exponentially with n. 
The novel quantum effect here - the passage from Cartesian to tensor product - 
is precisely the phenomenon of entanglement i.e. the ability to superpose general 
product states. 

As stated above, quantum entanglement can be readily mimicked by clas- 
sical wave systems: instead of taking n two-level systems, we consider a single 
classical wave system with 2" levels, allowing general superpositions of all these 
levels, and merely interpret these as entangled states via a chosen mathemat- 
ical isomorphism between (8>"V2 and V2« (where 14 is a fc-dimensional vector 
space). However this mathematical isomorphism is not a valid correspondence 
for considerations of complexity (i.e. in which we assess the utilisation of phys- 
ical resources): if the 2" classical levels are, say, equally spaced energy modes, 
then to produce a general state in 14" we will need to invest an amount of en- 
ergy exponential in n, whereas a general state in C>"V2 will require only a linear 
amount of energy (as at most, each of the n two-level systems will need to be 
excited). The essential point here is that entanglement allows one to construct 
exponentially large superpositions with only linear physical resources and this 
cannot be achieved with classical superposition. 

In the sense described above the state can encode an exponentially 
large amount of information. This would be of little consequence if we could 
not process the information in a suitably efficient way. Fortunately the laws 
of quantum physics allow precisely this possibility, which appears to be at the 
heart of the computational speedup exhibited by the known quantum algorithms. 
Suppose we apply a one-qubit gate U to the first qubit of the entangled state 



Quantum Effects in Algorithms 105 



\ipn)- This would count as just one step of quantum information processing but to 
compute the result classically (say by matrix multiplication) we would calculate 
the new amplitudes by 



where Uji is the unitary matrix for U . Now, this computation involves exponen- 
tially many steps: the 2x2 matrix multiplication of U needs to be performed 
successively 2”“^ times for all possible values of the indices j 2 ■ ■ -jn- Although 
the action of U on qubit 1 is a physically simple operation, it is represented 
mathematically as a tensor product U ® I 2 ® ® I 2 (where I 2 is the identity 

matrix which represents “doing nothing” on qubits 2 to n) and hence mathe- 
matically it becomes an exponentially large unitary operation. Thus because of 
the tensor product rule we can (somewhat enigmatically) state the principle: 
(PI): The physical act of doing nothing on part of an entangled composite 
system is a highly nontrivial operation. It leads to an exponential information 
processing benefit if used in conjunction with performing an operation on an- 
other (small) part of the system. 

Indeed it is difficult to process the quantum information by only a “small 
amount”. Eq. (|2I) illustrates that any small local operation (addressing a small 
part of the system) will generally correspond to an exponentially large processing 
operation from a classical point of view. Intuitively this reflects the denseness of 
the exponential quantum information stored within the linear resources. 

One may object to (PI), claiming that surely the information processing gain 
arises from the local operation that is actually performed (e.g. U above) rather 
than from the part that is not performed (e.g. the (n — 1) identity operations 
above) ! To see that this is not the case consider our row of n qubits and suppose 
now that U operates on the first k qubits (so t/ is a 2^ x 2* matrix). Let us 
compare the number of steps required to perform this transformation in the 
classical and quantum contexts respectively. It is known that any d x d unitary 
matrix may be programmed on a quantum computer in 0{d^) steps so 

the quantum implementation of U will require 0((2^)^) steps. Classically, direct 
matrix multiplication for a, d x d matrix requires O(d^) steps. For U we have 
d = 2^ and the multiplication must be performed 2"“^ times. Thus the classical 
implementation will require 0((2^)^2"“^) steps. Hence the ratio of quantum 
computing effort to classical computing effort is 0(2^/2"). This ratio decreases 
if either n is held fixed and k is decreased, or k is held fixed and n is increased. 
In either case we are increasing the proportion of “doing nothing” and this is 
giving rise to an increased information processing benefit. 

The Fourier transform is a fundamental ingredient pncBj in most of the 
known quantum algorithms which exhibit a super-classical computational speed- 
up. This includes the algorithms of Deutsch nn!> Simon HH, Shor 1 1 21 1 ;i[ and 
Grover US!. Using the mathematical formalism of the fast Fourier transform 
(FFT) P3|, the unitary transformation that is the Fourier transform can be 
implemented exponentially more efficiently in a quantum context than in 



1 
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any known classical context. For example, for the group of integers modulo 2"’ 
the classical fast Fourier transform algorithm runs in time 0(n2") whereas its 
quantum implementation runs in time O(n^). An analysis of the implementation 
of the FFT algorithm in the quantum context, given in detail in j^, shows that 
the achieved exponential speedup may be entirely attributed to the influence 
of the principle (PI). This appears to be an essential feature of the speedup 
exhibited by all known quantum algorithms. 

The full (exponentially large) amount of information embodied in the iden- 
tity of a quantum state |^n) is termed “quantum information”. The formal- 
ism of quantum mechanics places an extraordinary limitation on the above 
entanglement-related benefits of quantum information storage and processing: 
quantum measurement theory implies severe restrictions on the accessibility of 
the quantum information in the state. For example, according to Holevo’s the- 
orem 0 we can obtain at most n bits of information about the identity of an 
unknown state \tpn) of n qubits by any physical means whatever. This bound is 
the same as the information capacity of a classical system with the same number 
of levels. Thus, curiously, natural physical evolution in quantum physics corre- 
sponds to a super-fast processing of (quantum) information at a rate that cannot 
be matched by any classical means, but then, most of the processed information 
cannot be read! It is a remarkable fact that these two effects do not anull each 
other - the small amounts of information that are possible to obtain about the 
identity of the final processed state do not coincide with the particular meagre 
kinds of information processing that can be achieved by classical computation 
on the input running for a similar length of time. This disparity directly entails 
the computational speedup possibilities of quantum computation. 

3 Counterfactual Quantum Computation 

We have argued above that the information processing benefits seen in the known 
quantum algorithms all rest on some specific features of quantum entanglement. 
However these features do not exhaust all the ways in which quantum physics 
differs from classical physics. In an effort to find new quantum algorithms we 
might ask whether other non-classical features of quantum physics may be ex- 
ploited for novel computational possibilities (not necessarily just a speedup of 
computation). Quantum measurement theory (c.f. the inaccessibility of quantum 
information mentioned above) provides further non-classical aspects of the quan- 
tum formalism and these are also related to controversial interpretational issues. 
We will now describe a novel computational possibility which we call “counter- 
factual quantum computation”, based on properties of quantum measurement. 

A counterfactual effect may be defined as an observable physical effect E 
whose outcome depends on an event A that might conceivably have happened 
but in fact did not happen i.e. E is affected by the mere existence of A as a valid 
possible alternative even though A did not actually occur. Classical physics does 
not allow physically observable counterfactual effects but quantum physics does, 
at least in the sense described below. Their surprising and somewhat paradoxi- 
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cal occurrence in quantum mechanics has been highlighted in Penrose pn| (see 
especially §§ 5.2, 5.3, 5.7, 5.8, 5.9, 5.18). 

Suppose that we have a quantum computer which has been programmed 
ready to solve a decision problem. The computer also has an on/off switch, 
initially set in position off. We will show that in certain circumstances, the mere 
fact that the computer would have given the result of the computation if it were 
run, is sufficient to cause a physically measureable effect from which we can learn 
the result, even though the computer is in fact not run\ Our method is based 
on the so-called Elizur-Vaidman bomb testing problem m and the essential 
idea may be clarified by considering the operation of a simple Mach-Zender 
interferometer, which we discuss first. 

Consider the Mach-Zender interferometer as shown in the following diagram. 




Here HI and H2 are beam splitters and Ml and M2 are rigid perfect mirrors. 
The action of each beamsplitter is taken to be the following (written in terms of 
the states labelled at H2). For horizontal photons 

|[7) ^ + |G)) (3) 



\L)^^{\F)-\G)) 



and for vertical photons 



(4) 
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A photon enters at |A) and is separated into a superposition -^(1^) + |t^)) of 
upper and lower paths. In the absence of the measuring instrument A4 the two 
beams coherently interfere at H 2 and according to eqs. o and the result is 
\F). Thus the photon is always registered in detector T and never in detector Q. 

Consider now a nondestructive measurement device M. placed in the lower 
arm, which registers whether or not the photon passed along that arm. The 
initial state of M. is \Mq) and if a photon is registered it becomes an orthogonal 
state I Ml). Following the photon we now have 

|A) - ^(|C7) + \L)) |Mo) - |Mo) + \L) \M,)) (5) 

and the last state may be thought of as the “collapsed” mixture of \U) |Mq) or 
\L) I Ml), each with probability half. Thus the interference at iJ2 is spoilt and 
we always have a 50/50 probability of registering the photon in either T or Q. 

Suppose now that the photon is registered in Q and the measurement instru- 
ment is seen to be in state |Mq). (This event occurs with probability ^.) Thus 
the photon has been registered absent in the lower arm and the measurement 
instrument, having thus apparently done nothing, remains in state |Mq). Yet 
the photon is seen at Q, which is forbidden in the absence of Ad! Although Ad 
apparently does nothing, it cannot be removed, since then the photon can never 
register in Q. This is our fundamental counterfactual effect: we can say that the 
photon can be registered in Q because if the photon would have gone along the 
lower path, it would have been detected, even though it did not, in fact, go along 
the lower arm (since it was not seen by Ad). 

We can use this effect for computational advantage as follows. Consider an 
idealised quantum computer which is an isolated physical system containing an 
on/off switch, a set of program/data registers denoted by the state |comp) and 
an output register. The on/off switch is a two-level system with basis states |on) 
and |off) and the output register is a two-level system with basis states |0) and 
|1). The program/data registers are set up (“programmed”) to solve some given 
decision problem together with its input (e.g. it might be programmed to test 
for primality together with a given input integer.) The output register, initially 
in state |0) will be set by the computation to |0) or |1) according to the answer 
of the decision problem. The length T of the computation is a known function 
of the input. The time evolution of the computer for time T is given by 

|on) |comp) |0) ^ |on) |comp) |r) 

I off) I comp) |0) — *■ I off) I comp) |0) 

Here r = 0 or 1 is the (initially unknown) result of the computation and the 
computation will run only if the switch is set to “on” . The result is written into 
the output register and all program/data registers are returned to their initial 
state. 

Heuristically we will relate this scenario to the interferometer as follows. Ad 
is the quantum computer with |Mq) and |Mi) being the states |0) and |1) of the 
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output register. The photon is the on/off switch and the two paths are delayed 
by a time T for the photon to eventually arrive at H2. Thus if r = 0 the running 
of the computation makes no distinction between the paths and the photon is 
always seen va. T . li r = 1 the computation (if it ran) would distinguish the 
two paths and we will see the photon at Q with probability As before, with 
probability ^ the photon will register at Q (so that we are sure that r = 1) and 
the output register will be seen to be in state |0). Thus the computation has not 
run, yet we have learnt the result! 

More formally in terms of states of the computer, we first set the on/off 
switch to the superposition: 

Icomp) 10) (6) 



and then allow time T (the computation time) to elapse yielding the state 
1 



V2 



(I off) I comp) 



Next rotate the state of the switch by 



1 



|off) ^ 



I on) I comp) |r)) 



|on) ^ -^(loff) - |on)) 



(7) 



This yields the state 



1 / doff) + |on» doff) - l°°» 



72 V 72 



72 



1 

72 




(|o) + k)) 

72 



+ I on) 



(|o) - 17) 
72 




(8) 



Here r = 0 or 1 according to the (as yet unknown) result of the computation. 
Next we measure the switch to see if it is on or off. Note that if r = 0 then we 
never see “on” and if r = 1 we see “on” with probability 1/2. Suppose that we 
see “on” . Then we know that the result of the computation must certainly be 
r = 1. We then examine the output register which will show |0) with probability 
1/2. If this happens then the computation has not been run (because if it had, 
then the output register must show |1)). Overall, if the result is actually r = 1 
then with probability 1/4 we learn the correct result (and know it is correct) 
with no computation having taken place! 

Note that if the actual solution of the decision problem is r = 0 then we 
will never ascertain this from the above procedure because if r = 0 then the 
output register will always show 0 and the switch will always be finally seen to 
be “off” . But this outcome also arises for r = 1 with probability ^ and we cannot 
a posteriori distinguish the two possible causes. Correspondingly, if the actual 
solution is r = 1 then with probability j will we fail to ascertain this. 

The above description of the process represented by eqs. (0 to (jS|) involves 
some delicate interpretational issues. For example, a many-worlds adherent might 
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object that initially the switch was set in an equal superposition of being on and 
off, so even in the subsequent case of “no computation taking place” the com- 
puter actually did run in another “parallel universe” so we cannot claim to get 
the result for free. One may, to some extent, counter this objection as follows: 
suppose that when the result is really r = 1, the computer is also designed to 
explode at the end of the computation, if it is run. Then using the above pro- 
cedure, in my world I learn that r = 1 and the computer remains unexploded, 
available to do another run. I do not really care if it self-destructs in some “other 
universe” ! 

The counterfactual quantum computation procedure above may be consid- 
erably improved (using a method inspired by the improvements to the Elizur- 
Vaidman problem given in m) to essentially eliminate the deficencies noted 
above. As described below, we will achieve the following: 

For any given e > 0 

(i) If the result is r = 0, we will learn this with probability 1 but some compu- 
tation will have taken place. 

(ii) If the result is r = 1, we will learn this with probability 1 — e with no 
computation having taken place. 

Thus for the many-worlds adherent, the universe in which the computation takes 
place can be made to occur with arbitrarily small amplitude 0(^/F) (in the case 
that r = 1), which considerably weakens his/her/its objection. Recall that many 
basic results in information theory and computer science are formulated in an 
asymptotic framework which allows an arbitrarily small failure of some desired 
property. This occurs for example in the distinction between the complexity 
classes P and BPP jIS| (the latter allowing an arbitrarily small probability of 
a false result) and Shannon’s source coding theorem having not perfect fidelity, 
but fidelity 1 — e (for any e > 0) for the signals reconstructed from their coded 
compressed versions. Thus if some undesirable result can be made to occur with 
arbitrarily small (although non-zero) probability then FAPP it may be ignored. 

Pg 

The improved counterfactual scheme exploits the so-called quantum watched 
pot effect (or quantum Zeno effect) and it goes as follows. We note first that the 
state |comp) will never become entangled with the other registers so we omit it, 
writing the action of the computer as 

|off) |0) ^ |off) |0) 

jon) |0) ^ jon) |r) ' ' 

Choose an angle ct = -^ for N sufficiently large (c.f. later). Then perform the 
following five operations: 

(a) Rotate the switch by angle a. 

(b) Allow the running time T to elapse. 

(c) Read the output register. If it shows 0 then continue. If it shows I then 
discard the state and start again from the beginning. 



Quantum Effects in Algorithms 111 



Remark, (a) and (b) will result in the evolution 

|off) |0) ^ (cos a |off) + sin a |on)) |0) ^ cos a |off) |0) + sin a |on) |r) (10) 

If r = 0 then the output will always show 0 and (c) will result in the state 
(cos a |off) + sin a |on)) |0) with probability 1. If r = 1 then (c) will result 
in the collapsed state |off) |0) obtained with (high) probability cos^ To 
complete the procedure we: 

(d) Repeat (a), (b) and (c) a further — 1 times. 

(e) Finally measure the switch to see if it is on or off (assuming that all stages 
have been kept in (c) and (d)). 

We claim that in (e), if the switch is seen to be “on” then r is certainly 0 (and 
some computation has been done), and if the switch is seen to be “off” , then r is 
certainly 1 and no computation has taken place. In the latter case the probability 
of keeping all stages is (cos^ which tends to 1 as ^ oo. Thus by choosing 
N to be sufficiently large we can make the probability of success greater than 
1 — e for any given e. 

To see that our claim is correct, note that if r = 0 then the switch is just 
successively rotated from |off) to |on) in N stages and it never entangles with the 
output register. If r = 1 then the state is repeatedly collapsed to |off) |0) so that 
no computation takes place in any stage (because if it did, the output register 
would show the result r = 1). Indeed the waiting in (b) acts as a measurement of 
“on” versus “off” for the switch (if r = I) and in this case, we are just freezing the 
switch in its |off) state by frequent repeated measurement. This is the quantum 
watched pot effect. 

Note that according to (i), if r = 0 then this result is not learnt “for free”. 
A natural question is whether or not there is a counterfactual scheme which 
yields the information of either result (r = 0 or 1) with no computation having 
taken place. The procedure described above may be readily modified to provide 
a scheme with the following properties: with probability 1 — e we learn the result 
and for either outcome, be it r = 0 or r = 1, it is obtained for “free” with 
probability We also learn whether or not the produced result was obtained 
for “free” . It remains an open question whether or not each of the two results 
may be obtained for “free” with high probability 1 — e, or indeed, whether the 
sum of these two probabilities can be made to exceed 1. 
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Abstract. In order to design a quantum circuit that performs a desired 
quantum computation, it is necessary to find a decomposition of the 
unitary matrix that represents that computation in terms of a sequence 
of quantum gate operations. To date, such designs have either been found 
by hand or by exhaustive enumeration of all possible circuit topologies. In 
this paper we propose an automated approach to quantum circuit design 
using search heuristics based on principles abstracted from evolutionary 
genetics, i.e. using a genetic programming algorithm adapted specially 
for this problem. We demonstrate the method on the task of discovering 
quantum circuit designs for quantum teleportation. We show that to find 
a given known circuit design (one which was hand-crafted by a human), 
the method considers roughly an order of magnitude fewer designs than 
naive enumeration. In addition, the method finds novel circuit designs 
superior to those previously known. 



1 Introduction: Quantum Circuit Design 

1.1 Quantum Computation 

Quantum computation is an emerging area of study, which considers the pro- 
cessing of quantum information, rather than the familiar classical information. 
The state of a quantum computer is defined as a superposition of qubits. A com- 
putation on such a computer is the unitary evolution of this state, i.e. the action 
of a unitary matrix operator U upon the state \^). More detailed background 
on the framework of quantum information processing may be found in 
and (IS|. 



1.2 Quantum Gates and Circuits 

Much recent work has been devoted to the construction of unitary transforma- 
tions from sequences of more primitive ones. Deutsch ([0]) introduced the notion 
that such simple unitary operators can be thought of as elementary gates per- 
forming logical operations, and more sophisticated operators can be thought of 
as circuits composed of gates, in analogy to the standard formalism for classical 
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Boolean electrical circuits. This is sometimes called the network model of com- 
putation. Following the classical computation line of analysis, in which certain 
small sets of gates (as small as one gate) are known to be sufficient to represent 
all possible circuits, several researchers have proposed such universal gate sets 
(as small as a single parametrized gate family) for quantum circuits ( u, m)- 
Besides the identification of such sets, some attempts have been made to char- 
acterize the minimal number of gates drawn from a given universal set required 
to implement a given operator U ([7]). 



1.3 Circuit Design 

Now assume we would like build a circuit to implement a certain computation, 
represented by U . Most likely our mechanisms for manufacturing quantum com- 
puters will begin with allowing us to implement certain very specific primitive 
quantum operations more effectively than others, for a variety of reasons which 
will be peculiar to the technology. Given that we have a reasonable set of gates 
from which to select circuit elements, and perhaps some theoretical ammunition 
regarding the minimum number we will need, we are still left with the following 
practical question: What is a specific sequence of those gates that will implement 
the operation? After we have an efficient and ffexible method for answering this 
question, we will want to answer the following: What is a specific sequence of 
those gates that will implement the operation using only the minimum number 
of gates necessary? As the enterprise of building quantum circuits matures, we 
may eventually wish to find circuits meeting other measures of optimality aside 
from parsimony. This paper presents a solution to the first (and most important) 
problem, which also indirectly addresses the issue of parsimony by allowing the 
size of the circuits considered to vary. 



2 Searching the Space of Circuit Designs 

2.1 Automated Circuit Design 

In this paper we are concerned not with the theoretical analysis of minimality of 
representation, but rather with the practical automated discovery of a correct 
circuit for a target unitary transformation U. We characterize the problem as a 
search over the space of possible circuit designs. We focus foremost on demon- 
strating a search algorithm which finds a correct circuit in less time than it 
would take to try every possibility. Parsimony of representations will be encour- 
aged through the thoughtful definition of heuristics in the search procedure. It 
is useful to state here that to avoid exhaustive enumeration, we give up any 
worst-case guarantee of finding a correct circuit design; so far this is the state of 
the art in combinatorial optimization (0)- 
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2.2 The Search Space 

There are two components to a quantum circuit design. One is the topology of 
the circuit - the gate elements and the connections between them. This is a 
discrete entity. An important complication enters when we wish to allow topolo- 
gies to have different sizes, i.e. numbers of gates, which we would prefer to leave 
unspecified when automating circuit design, leaving the algorithm to find the 
appropriate size. The second is the assignment of angle values within the gates, 
if applicable; when our gate selection set includes gates which are actually para- 
metric families of gates, there are continuous parameters to be found. 

The paper of Di Vincenzo and Smolin ([Zj) discussed numerical optimization 
for the discovery of parameters for two-qubit gates, within a fixed circuit topol- 
ogy, which lead to a desired unitary computation. They used this technique to 
show that certain gates of interest (the Toffoli gate and arbitrary three-qubit 
gates) could themselves be represented as circuits of two-qubit gates, by finding 
the necessary two-qubit gate parameters. In order to find the necessary circuit 
topologies, however, all possible topologies were tried. The focus of that paper 
was to show the possibility of decomposing particular computations into circuits 
of simpler gates; thus exhaustive enumeration was sufficient as a tool to prove 
the point. We are interested here in a practical and general method for efficiently 
finding correct circuit topologies for any given operator, in other words avoid- 
ing exhaustive enumeration. We return to the continuous aspect of the search 
problem later in Section 0 

3 Genetic Programming: A Set of Search Henristics 

3.1 Why Genetic Programming? 

Our search problem makes a difficult demand on any search method we might 
think to employ. First, the search method must be amenable to problems in 
which it is difficult to characterize the structure of the solution space exactly. To 
clarify this point, consider that our formulation of the problem leaves the form of 
the target unitary transformation U completely unspecified; no deep knowledge 
of U’s substructure, behavior, relationship to the gates used, or nature otherwise 
can be used to advantage to eliminate invalid possibilities in the search problem. 
This very general stance is appropriate for quantum circuit design since human 
techniques and intuitions about quantum circuits have not reached a mature 
stage yet; once specific classes of quantum circuits can be delineated, it may be 
fruitful to design search methods which take advantage of their extra constraints. 
Furthermore, the quantum circuit design problem is one in which it is difficult 
to evaluate the best next local move to make at any given point in the search; 
the entire solution must then be evaluated in order to evaluate the effect of a 
local change in a circuit candidate. Genetic programming is appropriate in this 
setting since it relies only on evaluations of entire circuits. 

Second, it must be capable of considering solution structures of variable 
length. This is crucial if it is to have any hope of finding small designs; it must 
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be given the latitude to explore solution candidates of different sizes. A par- 
ticular set of search heuristics, the so-called genetic programming method 
has the distinction of being the only search technique having the capability of 
searching over solutions of varying structure and size. Genetic programming is a 
type of genetic algorithm which in turn is a type of stochastic hill-climbing, 
or “go with the winners” algorithm (Q), along with simulated annealing (USD- 
Genetic programming is the kind of genetic algorithm which is concerned with 
non- fixed-length topological structures, rather than the simpler case of fixed- 
length solutions. 

3.2 The Parts of Genetic Programming 

Genetic programming is a simple set of search heuristics based loosely on the 
principles of evolutionary genetics. One of its most distinctive traits is that it is 
a population-based method, or one which maintains multiple solution candidates 
simultaneously, whose ’evolution’ paths may interact with each other. In partic- 
ular, they may trade substructures in an operation called “crossover” , in analogy 
to sexual reproduction. The method is heavily stochastic, sometimes perform- 
ing random perturbations on solution candidates {“mutations”), and greedily 
selecting the current best solutions to continue pursuing via random sampling 
weighted by solution quality {“fitness”, “survival of the fittest”). A typical ge- 
netic programming algorithm has this form: 

Initialize population with random solutions. 

Until the stopping criterion has been reached, 

1. Evaluate the quality of each solution in the population. 

2. Sample from the population, weighted by solution quality, to 
form the ‘breeding pool’ . 

3. For each member of this subset of the population, choose one 
of the following operations to perform on it: 

a. Mutation (choose with probability p(M)) 

b. Crossover(choose with probability p(C); requires a partner) 

Each iteration of the algorithm is called a “generation” . 

Because its directional guidance is based on evaluations of entire solutions, all 
that is necessary to apply the algorithm to a problem is a well-defined measure of 
solution quality; it is thus amenable to problems in which it is difficult to evaluate 
the best local move to make at each partial solution (such as the circuit design 
problem). The main power of the method, which distinguishes it from simple 
stochastic local perturbation, is in the crossover operation. If the problem is one 
in which we expect substructures to contain localized information, i.e. represent 
meaningful subsolutions (an analogy to subroutines of a program is useful here), 
then crossover has a hope of successfully transferring a subsolution to a different 
solution, perhaps increasing its overall quality. In the circuit design problem, it 
seems reasonable to expect that transferable subcircuits exist. Grossover is also 
the main mechanism for obtaining topology candidates of different sizes. 
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4 A Genetic Programming Algorithm for Quantum 
Circuit Design 

For this investigation we designed a genetic programming algorithm tailored 
specifically for the problem of quantum circuit design. 
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Fig. 1. An example circuit. 



4.1 Representation 



Circuit Representation. An anonymous quantum circuit is shown in Figure 
n as an example of the representation we use. It is represented as the following 
nested list data structure, which encodes with each circuit element, its name, 
parameters if any, and embedding (the wires to which it is connected, followed 
by the number of wires in the circuit: three in this case): 




params[], {1; 3} 
,params\\, {2, 3; 3} 

paramsW, {2; 3}'| 
,params[], {1; 3} 

, paramsW, {3; 3} 



V2 

'l 0 0 0\ 
0 10 0 
0 0 0 1 
\0 0 1 0 / 
' { i o' 



, paramsW, {3, 1; 3} 
Q ^ j , paramsW, {1; 



( 1 ) 
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Gate Selection Set. The algorithm chooses gates from a prespecified selection 
set. These gates may have unspecified continuous angle parameters associated 
with them, which must be adjusted by the search algorithm. The gates may 
also be fixed, or parameterless, gates. In a general setting where little is known 
about the target transformation, it is sensible to select the gate set such that 
it forms a universal gate set. It may also be sensible to choose an overcomplete 
set, one which includes a number of gates beyond a computation-universal core 
subset. This may be useful for obtaining more compact representations, yet may 
be more costly than having a smaller number of gate types, depending on the 
technological practicalities of quantum hardware manufacture which hold at the 
time of the design. An undercomplete set may make sense when some known 
properties of the target computation allow it. 

4.2 Evaluation 

Solution Quality Measure. To evaluate the quality of a circuit candidate, we 
compare its matrix form S to the target matrix U using the objective function 



We call / the fitness or the discrepancy, our goal is to find circuits which 
minimize the discrepancy between the circuits in our population and the target. 
When / = 0, we have found a circuit which implements U exactly. Otherwise, 
we have found an approximation to U. 

We regard the most sensible evaluation measure as an open question. A 
paper by Knill El considers several measures, many of which are not practically 
computable, since they take into account all possible states on which the operator 
may act. One requirement of the measure chosen is that it yields a minimum 
(maximum) when S = U' this property is true of all of Knill’s measures. There 
is a degree of arbitrariness in specifying the proper qualitative behavior of the 
metric when S differs from U. 

While a measure such as / allows the discovery of approximate circuits in a 
well-defined way, in this paper we focus only upon unitary operations which we 
can represent exactly. 

4.3 Selection 

Selection is the choosing of a subset from the population to modify in some way. 
Sampling is weighted by a factor derived from a circuit candidate’s discrepancy 
score, in the way described below, and is performed at the beginning of each 
generation. 




i=l 3 = 1 

This is similar to the objective function used in jjj: 




( 3 ) 



Automated Design of Quantum Circuits 119 



A Ranking-Based Scheme. Rather than translate the discrepancy score of a 
circuit into its selection probability such that the latter is directly proportional to 
the score, we instead first order the circuits according to their discrepancies, then 
determine selection probabilities based directly on the resulting rankings. This 
procedure has the effect of desensitizing the process with respect to the exact 
discrepancy distribution, which tends to exhibit extreme ratios between the best 
candidates and the worst ones; we would like to deemphasize such differences in 
order to avoid complete domination of the selection process by a few candidates 
too early in the evolution, which corresponds to entrapment in a local optimum. 



Selection Probability Distribution. The circuits are ranked from 1 to N, the 
number of circuits in the population, 1 denoting the best. Probabilities are de- 
fined with which to select members of the population for breeding (i.e. crossover), 
mutation, and other operations which yield modified solution candidates. We de- 
sire a functional form yielding probabilities of selection which decrease as the 
ranking increases (i.e. gets worse), choosing a quadratic form as a compromise 
between a form yielding a very weak selection effect (which makes the algorithm 
closer to a purely random search) such a linear decrease, and a form yielding 
a very aggressive selection effect (making the algorithm more ’greedy’, or sus- 
ceptible to short-term gains which might cause it to become trapped in a local 
optimum), such as an exponential decrease. 

The probability P{r) of selecting the circuit having ranking r is then ar'^+br+ 
c for some a, b, and c. To determine some values for these variables we set up some 
constraints, namely that P{r) is a true probability, i.e. -I- -|- c = 1, 

that the lowest ranked member is never picked, i.e. aN'^ + bN + c = 0, and that 
the derivative of the probability goes to zero as r goes to N, guaranteeing that 
the probability function is monotonic decreasing. This set of equations yields 
values of a, b, and c such that 



P{r) = — 



6N 



12 



3A-k2iV2 A(1 -3iV-k2iV2) l~m + 2N‘^' 



(4) 



To derive the new generation’s population from the last generation’s mem- 
bers, selection from the described probability distribution is performed N times 
with replacement; note that the population size stays constant and that on av- 
erage circuits are multiply represented in the next generation a number of times 
proportional to their fitness. This process yields the parents which are fit enough 
to draw upon for the various modifications (i.e. search operations) that follow. 

To finish the activity of this generation, each parent is replaced by a new cir- 
cuit resulting from an operation performed on it; the operation to be performed 
on each circuit is chosen from a discrete probability distribution determined by 
the user of the algorithm. 
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4.4 Search Operators 

Mutation. Mutation is the random perturbation of a single gate, chosen uni- 
formly at random from the gates within the operand circuit. In the case of fixed 
gates, i.e. gates without parameters which can vary, the selected gate’s embed- 
ding is changed by uniformly randomly selecting new connecting lines to replace 
the old ones. 



Substitution. Substitution is similar to mutation, but is the replacement of 
an existing gate chosen uniformly randomly from the gates within the operand 
circuit, with another one selected from the gate selection set uniformly randomly. 
Though replacement can be achieved through an appropriate insertion-deletion 
pair of operations, described below, its inclusion as a separate operation allows 
its probability of occurence to be more explicitly controlled. 



Crossover. The circuit resulting from the crossover, or mating, operation is 
obtained by considering two parent circuits, A and B. A split point is chosen 
uniformly randomly somewhere along each of the two parent circuits. The circuit 
resulting from crossover has the first part of the circuit A attached to the second 
part of the circuit B, or the first part of the circuit B attached to the second 
part of the circuit A, each with probability 0.5. Note that crossover allows the 
size of the resulting circuit to change from that of either A or B. 

Transposition. Transposition is an operation obtained by generalizing cross- 
over; its result is also defined by considering two parents A and B. A subcircuit 
is first defined by the selection of beginning and end points in parent A. The 
beginning point is chosen uniformly randomly along the length of A, and the 
end point is chosen uniformly randomly from the region between the that point 
and the end of A. The resulting circuit is found by inserting the subcircuit at a 
uniformly randomly chosen point along the length of parent circuit B. This also 
allows the size of the resulting circuit to change from that of either A or B. 



Insertion. Insertion is similar to transposition, except that only one parent 
need be considered; a randomly constructed sequence of gates is inserted at a 
random point in the parent, resulting in a larger circuit. The beginning and end 
points of a subcircuit of the parent are chosen as described for the transposition 
operator, only so that the length of this subcircuit can be used as the length 
of the random gate sequence to be inserted. This sequence is constructed by 
choosing uniformly randomly from the gate selection set the described number 
of gates. 

Deletion. Deletion is the inverse of insertion, in that a random subcircuit is 
chosen from within the parent; this sequence is deleted from the parent, resulting 
in a smaller circuit. 
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5 Experimental Results: Quantum Teleportation Circuits 

Quantum teleportation has been identified as an important and interesting ap- 
plication of nonlocal effects in quantum mechanics [3- Brassard has presented 
a circuit for the ’send’ and ’receive’ halves of quantum teleportation in This 
circuit is compact, requiring only 4 gates in the ’send’ subcircuit and 6 in the 
’receive’ subcircuit. It is shown in Figure 0 The gate definitions can be found in 
the example circuit shown inland Figure ^ 
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Fig. 2. The quantum teleportation circuit - ’send’ and ’receive’ parts. 



We chose to demonstrate the search algorithm on the computation matrix 
generated by this circuit, primarily for its general interestingness. Its small size 
gives the advantage of tractability in the algorithm experimentation phase. Also, 
because we start with a circuit to obtain the target unitary transform, we know 
that a compact circuit implementation exists for the problem. We can analyze 
the computational resources our search method requires to reproduce the hand- 
designed circuit. As discussed in Section 14.21 using a problem for which an exact 
circuit representation is known to exist for the gate selection set used avoids the 
need to consider the appropriateness of the particular fitness measure being used 
to score inexact circuits. 

5.1 The ’Send’ Circuit 

The algorithm was given the send circuit’s computation matrix and a gate se- 
lection set consisting of L, R, and XOR. 10 runs were performed, each requiring 
a different number of generations to find a correct circuit, as follows: 9, 26, 16, 
10, 31, 11, 20, 55, 36, 50. 26.4 generations were required on average. 

In each case a circuit was found implementing the given computation exactly; 
although most were different from the original human-designed circuit, all had 
4 gates and included at least one each of the L, R, and XOR gates (thus none 
was necessarily any better than the original circuit). The variance of the number 
of generations required to find a zero-discrepancy circuit is large, owing to the 
heavily stochastic nature of the algorithm. 

A population size of 100 circuit candidates was used. This is the number of 
circuit solutions which must be evaluated upon each generation of the algorithm. 
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Thus, on average, about 2,640 circuits are evaluated for this problem before an 
answer is found. 

By comparison to exhaustive enumeration, the number of possible circuit 
topologies for this problem, knowing the number of gates to consider in advance, 
can be simply computed as follows: With 3 circuit lines, there are 3 ways to 
embed the L gate, 3 ways to embed the R gate, and or (f) = 6 ways to embed 
the XOR gate, yielding 3 + 3 + 6 = 12 different choices for each gate possibility. 
If we fix the topology size we consider to 4 gates, there are 12“^ = 20,736 dif- 
ferent possible topologies to consider for this problem, using a naive exhaustive 
approach. Since our search method actually considers circuits of many different 
sizes, a fair comparison would have to take into account every size class of circuit 
up to some fairly high number. Our method considered circuits at least as large 
as 13 gates; note that there are 12^^ > 10^"* circuits having 13 gates! 

We note here that this number does not take into account symmetries and 
other structure in this search problem, several of which are considered in [ 7 ]. 
Even accounting for these effective reductions of the search space, the compu- 
tational advantage of a stochastic approach such as the one proposed is still 
quite significant. Our method may be also be able to take advantage of such 
information for even greater search efficiency. 

Figure 0 shows a typical plot of the average circuit discrepancy over the 
population at each generation for this problem. The dots on the lower portion 
of the graph indicate the discrepancy of the best circuit (s) in the population at 
each generation. 
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Fig. 3. Typical evolution plot. 



5.2 The ’Receive’ Circuit 

Experiments with the ’receive’ part of the circuit demonstrate a further advan- 
tage of this approach to automated circuit design beyond achieving a significant 
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savings in time and computational resources. The flexibility and generality of our 
approach allows the human user to select a gate set of interest and see whether 
interesting circuits using those gates are found by the search technique. This 
type of automated search has the potential to And circuits which are difficult for 
even resourceful and expert human circuit designers to And. This is true espe- 
cially when a large number of gates is involved; however this small but practical 
circuit example illustrates that even modest combinatorial problems are very 
difficult to And optimal answers for, when unaided by computer methods. 

Rather than the original set of gates used in ^ for this circuit, consisting of 
S, T, and XOR, the genetic programming algorithm was given the gate selection 
set used above, consisting of L, R, and XOR. One of the resulting exact circuits 
is shown in Figure 0 Comparing this to the original ’receive’ part of the human- 
designed circuit shown in Figured it is clear that the new circuit is smaller (4 
gates versus 6), and that the overall teleportation circuit is more elegant since 
it requires only 3 types of gates, L, R, and XOR, rather than 5 now that S and 




Fig. 4. An efAcient circuit found by the search method. 



6 Discussion 

6.1 Genetic Programming Search as a Tool 

At the moment, genetic programming’s ability to work with structures of varying 
sizes makes it the only tool available. Its other primary strength is its eAective- 
ness for opaque problems, where search moves are diflrcult to evaluate without 
considering their eflect on the entire solution. Rather disappointingly, however, 
the method’s search heuristics are not well-understood formally. For example, is- 
sues of convergence, estimated run-time, optimal parameter settings, and behav- 
ior dependence on problem context remain empirical issues. Aldous and Vazirani 
provide one way in which to understand genetic algorithms in general, placing 
them with simulated annealing in the class of “go with the winners” algorithms 
(m). However, this framework addresses only the ’survival of the Attest’ aspect 
of genetic algorithms, not the eflect of the crossover operation, which is one of 
the hallmarks of genetic algorithms. While much has been written about genetic 
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algorithms, most analyses have been empirical rather than formal. Genetic pro- 
gramming, dealing with variable-length structures, is also surely subsumed by 
some more general model which can be understood formally - unfortunately this 
has not yet arrived. 

On the positive side, its flexible framework allows the practitioner to plug in 
his or her own heuristics, encoding any prior knowledge of the problem the user 
may have (for example, regarding the size of the circuit or the types of gates to 
use). The specifiable gate selection set allows the specification of only the gates 
available to the user. 

6.2 Extension to Continuous Case 

The proposed search method can be extended to allow the inclusion of contin- 
uous, or parametrized, gates in the gate selection set, as opposed to the fixed 
gates used in these experiments. This capability requires necessitates greater 
computational effort since an optimization must be performed to tune the con- 
tinuous gate parameters of each circuit candidate such that the discrepancy is 
minimized given the circuit’s discrete topology. However, the ability to incorpo- 
rate continuous gates holds the promise of more compact circuit solutions, as 
well as better circuit approximations where necessary. Experiments elucidating 
this approach, as well as several other potentially powerful extensions, will be 
described in future reports. 

7 Conclusions 

In this paper we have formalized the problem of automated quantum circuit de- 
sign as a search problem. We proceeded to propose a search method tailored for 
this problem. We then demonstrated its usefulness by showing that it is com- 
putationally more efficient than naive enumeration. Finally, we demonstrated 
that it is capable of discovering useful circuits even when the number of gates 
considered is small, as exemplified by a novel circuit found by our algorithm for 
quantum teleportation. 
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Abstract. This paper shows how a basic property of unitary transformations can 
be used for meaningful computations. This approach immediately leads to 
search-type applications, where it improves the number of steps by a square- 
root - a simple minded search that takes N steps, can be improved to 

approximately steps. The quantum search algorithm is one of several 

immediate consequences of this framework. Several novel search-related 
applications are presented. 



1. Introduction 

Several interesting problems in computer science can be looked upon as search 
problems. There are two categories of such problems. First, where the search depends 
on data in memory - this is the database search kind of problems. Alternatively, the 
search could he based on a function known in advance - many NP-complete problems 
and cryptography problems can be expressed in this form. For example the SAT 
problem of NP-completeness asks whether there exists a combination of binary 
variables that satisfies a specified set of Boolean equations - this can be looked upon 
as a search of the state space of the binary variables. In cryptography, the well-known 
56-bit DES code (Data Encryption Standard) can be cracked by an exhaustive search 
of 2“ items [BBHT] [Phone]. 

It aroused considerable interest when it was shown that it was possible to improve 
upon the obvious classical bound for exhaustive search by resorting to quantum 
mechanics [Search] [BBHT] - the intuitive reason for this improvement was that 
quantum mechanical systems can be in multiple states and simultaneously explore 
different regions of configuration space. This improved the number of steps by a 
square-root, i.e. a simple minded search that takes N steps, could be improved to 

approximately -//V steps. The quantum search algorithm was derived using the 
Walsh Hadamard (W-H) transform and it appeared to be a consequence of the special 
properties of this transform. Subsequently [Gensrch] showed that similar results are 
obtained by substituting any unitary transformation in place of the W-H transform. 
This means that a variety of unitary transformations could be used in place of the W- 
H transform and this leads to algorithms for several different problems. This paper 
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describes the approach of [Gensrch] and shows how it can be extended to solve 
various structured problems. 



2. Framework 



A function f{x), x = 0, 1, ■■■{N — l) , is given which is known to be zero for all x 
except the single point (x = t), the goal is to find t (t for target). The obvious classical 
technique of searching by looking at the N values of x, one by one, would clearly take 
0{n) steps. 



Assume that we have at our disposal a unitary transformation U that acts on a 
system with N basis states. First map each value of X to a basis state and start with 
the system in the basis state s (s for start). If we apply U to s, the amplitude of 
reaching t is Uf^ , and if we were to make a measurement that projects the system 



into a unique basis state, the probability of getting the right basis state would be 



\UfJ . It would, therefore, take 



Utsf 

V “ 



repetitions of this experiment before a 



single success. This section shows how it is possible to reach state t in only 0 
steps. This leads to a sizable improvement in the number of steps if \UfJ « 1 . 



Denote the unitary operation that inverts the amplitude in a single state x by . 

In matrix notation this is the diagonal matrix with all diagonal terms equal to 1, 
except the xx term which equals —1 . 

Vjc denotes the column vector which has all terms zero, except for the x*^ term 
which is unity. 

Consider the following unitary operator: g = , since U is unitary U~^ 
is equal to the adjoint, i.e. the complex conjugate of the transpose of U . We first 
show that Q preserves the two dimensional vector space spanned by the two vectors: 



and \jj (note that in the situation of interest, when is small, these two 
vectors are almost orthogonal). 

First consider Qv^ . By the definition of Q , this is: - I^U ItUv^ . Note that 
is an NxN square matrix all of whose terms are zero, except the xx term which is 
1 . Therefore 1^ = 1- Iv^vJ & 1^=1 - 2v^vJ , it follows: 

Gv, = -(/ - 2v,vf (/ - 2v,vf )cv, 

= -(/ - 2v,vJ )j-^Uv, + l[l - 2v,vJ )jv, 



( 1 ) 
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—1 T 

Using the facts: U U and = 1 , it follows that: 

(2) Qv, = + 2(7 - 2v^vJ )yv, . 

Simplify the second term of (2) by the following identities: Vj Uv^ = Uf^ and since 
U is unitary, its inverse is equal to its adjoint fthe complex conjugate of the 
transpose) vJu~^Vf =11*^ ■ 

(3) Gv, =v,[^l-4|t/,/ j+27/,,(t/-S) 

Next consider the action of the operator Q on the vector U~^Vi . Using the 
definition of Q (i.e. Q = -I^U~^IfU ) and carrying out the algebra as in the 
computation of Gvi above, this yields: 

(4) q(u~\)^ -I,U-^I,u(u~\)= = I,U~\ . 

T T _1 * 

Writing as I-2v^v^ and as in (3), v^U Vf=Uf^: 

(5) q(u-\)=(u-\)-2v,vJ(u-\)=(u~\)-2uIv, . 

It follows that G transforms any superposition of and into another 

superposition of the two vectors, thus preserving the two dimensional vector space 
spanned by and (3) & (5) may be written as: 









1 


Vi 


-1 


— 


V J 




-1 


U 




-2UI 


1 


U 



It follows as in [BBHT], that if we start with , then after rj repetitions of Q we 
get the superposition ) where = cos(27]|t/,^|) & = sin if 

\Ufs \ « 1 . If rj = — — — , then we get the superposition ; from this with a single 



application of U we can get Vf . Therefore in O 
state and reach the target state t with certainty. 



steps, we can start with the s- 



3. Quantum Operations 

The interesting feature of the analysis of section 2 is that U can be any unitary 
transformation, whatsoever. Clearly, it can be used to design algorithms where {/ is a 
transformation on the qubits in a quantum computer - the object of this paper is to 
present some such applications. Quantum mechanical operations that can be carried 
out in a controlled way are unitary operations that act on a small number of qubits in 
each step. It is possible to design a variety of quantum mechanical algorithms using 
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just a few elementary quantum mechanical operations. Two of the elementary unitary 
operations needed are; the W-H transformation operation and the selective inversion 
of the amplitudes of certain states. 

2 " 



A basic single bit operation in quantum computing is the operation M - this is 



1 



represented by the following matrix: M = —== 

V2 



1 1 
1 -1 



transformed into a superposition: 



M 1 ^ 



, i.e. a bit in the state 0 is 
. Similarly a bit in state 1 is transformed 



M 1 ^ 



^V2’V2^ 

into I ^ I. In a system in which the states are described by n bits (it has 

1^V2 a/2 J 

A = 2” possible states) we can perform the operation M on each bit independently in 
sequence thus changing the state of the system. The state transition matrix 



representing this operation will be of dimension 2” x 2” . Consider a case when the 

starting state is one of the 2” basis states, i.e. a state described by an n-bit binary 
string with some Os and some Is. The result of performing the operation M on each bit 
will be a superposition of states described by all possible n-bit binary strings with the 



amplitude of each state being +2 ^ . To deduce the sign, observe that from the 



definition of the matrix M, i.e. M = —j= 

V2 



1 1 
1 -1 



, that the phase of the resulting 



configuration is changed only when a bit that was previously a 1 remains a 1 after the 
transformation is performed. Hence if 3c be the n-bit binary string describing the 
starting state and y the n-bit binary string describing the resulting string, the sign of 
the amplitude of y is determined by the parity of the bitwise dot product of x and 



y , i.e. (-l)^’^ . This transformation is the W-H transformation [DJ]. This operation 
(or a closely related operation called the Fourier Transformation [Factor]) is one of 
the things that makes quantum mechanical algorithms more powerful than classical 
algorithms and forms the basis for most significant quantum mechanical algorithms. 

The other transformation we will need is the selective inversion of the phase of the 
amplitude in certain states. Unlike the W-H transformation and other state transition 
matrices, the probability in each state stays the same since the square of the absolute 
value of the amplitude in each state stays the same. The following is a realization 
based on [BBHT]. Assume that there is a binary function f{x) that is either 0 or 1. 
Given a superposition over states x, it is possible to design a quantum circuit that will 
selectively invert the amplitudes in all states where f{x)= 1 . This is achieved by 
appending an ancilla bit, b and considering the quantum circuit that transforms a state 
\x,b'j into \x,f{x)XORb'^ (such a circuit exists since, as proved in [Revers], it is 
possible to design a quantum mechanical circuit to evaluate any function f(x) that 
can be evaluated classically). If the bit b is initially placed in a superposition 
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— !=(jo)-|l)), this circuit will invert the amplitudes precisely in the states for which 
V2 

f{x)= 1 , while leaving amplitudes in other states unchanged. 



4. Summary of Applications 



As mentioned in section 1, the search problem is the following: a function 
f{x),x = 0,1. ..(A-1), is given which is known to be non-zero at certain values of x; 
the task is to find one such value. No structure is known for /(x) except for what is 
explicitly mentioned in the specific problems (4.1). ..(4. 7). N is assumed to be a power 
of 2, say N = 2” . There is a one-to-one correspondence between the N values of x 
and the respective states of an n-bit register. States corresponding to values of x for 
which /(x) is non-zero, are referred to as t-states. 

(4.1) Assume that /(x)= 0 everywhere except for a single value of x. This is the 
standard problem of exhaustive search. 

(4.2) As in (4.1), there is a single point, t, where /(x) is non-zero. Some 
information about t is available in the following form - another n bit word, r, is 
given which is known to differ from t in at most k out of the n bits. 

(4.3) There are multiple points (r-states) at which /(x) is non-zero, it is required to 
find any one of these. Some structure to the problem is specified in the 
following form. We are given a certain unitary transformation U & multiple s- 
states so that for any s & any t are the same. The setting of this subsection 
is abstract, (4.4), (4.5) & (4.7) apply the framework of this subsection to 
concrete problems. 

(4.4) As in (4.3), there are multiple points at which /(x) is non-zero, it is required 
to find any one of these. However, unlike (4.3), no further structure to the 
problem is given. 

(4.5) There is a single point t where /(x) is non-zero. Some information about t is 
available in the form of I n-bit strings, each of which differ from t in exactly k 
out of n bits. 

(4.6) f{x)= 0 everywhere except at the unique point x = t, it is required to find t. 
Also, as in (4.3), we are given a unitary transformation U and multiple 5-states. 
However, unlike (4.3), for various 5-states & various t-states are not all 
identical. 

The analysis of section 2 extends the power of quantum search so that it can be 
used with an arbitrary unitary transform U, but only with a single 5 and single 
t-state. (4.3) extends it to multiple states, but in a restricted way. This 
derivation extends to multiple 5 -states. It is still not known how to handle 
multiple r-states that are not exactly symmetric. 
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(4.7) This problem illustrates how the abstract techniques discussed earlier can be 
applied to solve an actual problem. This problem was first discussed by Eddie 
Farhi & Sam Gutmann [Structure]. 

Two functions f{x,y) & g(x) are defined on the domain 
X = 0,1 ,...(a^- 1), y = 0,1 ,...(a^-1), . f(x,y) is zero everywhere except at the 
unique point 8{x) is non- zero at M values of x including x = q 

(hereM«Ai). The problem is to find q & t 2 - Classically this problem 
would take Q.{nm) steps. The algorithm of (4.1), without using the function 
g{x), would take 0{n) steps. The following analysis makes use of the general 

technique of (4.3) to develop an o{JnM ) step algorithm. Several variants of 
this problem are also considered. 

4.0 The Approach 

The following general approach is made use of in each of the next 7 sub-sections - 
(4.1)...(4.7). 

There are N = 2” states, represented by n qubits, the task is to get the system 
into some target state(s) t at which /(x) is non-zero. A unitary transform U and the 
initial state s are selected and is calculated. It then follows by section 2 that by 

^ repetitions of the operation sequence - , followed by a single 

application of U, the initial state s is transformed into the final state t. 



4.1 Exhaustive Search 



Assume that /(x)= 0 everywhere except at a single point t. The object is to find t. 
As mentioned in the first paragraph of the introduction, there are several important 
problems in computer science for which there no solution is known, except exhaustive 
search. 

Solution For the W-H transform, described in section 3, Uf^ between any pair of 
states j & t is ± — ^ . Therefore we can start with any state s and the procedure of 

^JN 



(4.0) gives us an algorithm requiring 0 



^ 1 ^ 



U,. 



steps, i.e. steps. 



In case s be chosen to be the 0 state, then the operation sequence Q = -I-^WIfW 

leads to the standard quantum search algorithm based on the inversion about average 

—1 T 

interpretation [Gensrch] (note that W =1T). To see this write /q as I-2vqV . 
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Therefore for any vector x : —Wlf^Wx = -W\ I - 2vqV_ 






= -x + 2WvqV_Wx: Ut,. 



It is easily seen that WvqV_Wx is another vector each of whose components is the 



same and equal to A where 




(the average value of all components). 



Therefore the t component of-W/QWx is simply: (x; + 2A) . This may be written as 
A + (A-x;), i.e. each component is as much above (below) the average as it was 
initially below (above) the average, which is precisely the inversion about average. 

In case s be chosen to be a state different from 0 , the dynamics is still very 
similar to the standard quantum search algorithm; however, the inversion about 
average interpretation no longer applies. 



4.2 Search when an Item near the Desired State Is Known: 



This problem is similar to (4.1), i.e. /(x)=0 except at the single point t. The 



difference from (4.1) is that some information about the solution, t, is available in the 
following form: another n bit word, r, is specified - t is known to differ from r in at 
most k of the n bits. 

Such a problem would occur in any situation when we had some prior information 
about the solution, this information could come either from prior knowledge or from a 
noisy data-transmission. 

Solution: The effect of the constraint is to reduce the size of the solution space. One 
way of making use of this constraint, would be to map this to another problem and 
then exhaustively search the reduced space using (4.1). However, such a mapping 
would involve additional overhead. This section presents a different approach which 
carries over to more complicated situations as in (4.5). 

Instead of choosing U as the W-H transform, as in (4.1), in this section U is 
tailored to the problem under consideration. The starting state s is chosen to be the 
specified word r. The operation U consists of the following unitary transformation 





, applied to each of the n qubits. Calculating , it follows that 



n-k 



= 11 -- 



Khtrlog 



n — k k n — k 
log— — 



The technique 



described in (4.0) can now be used - as in (4.1), this consists of repeating the sequence 
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of operations , -I^UIfU, d 



^ 1 ^ 



U, 



times, followed by a single application of the 



operation U (note that, as in (4.1), U ^ = U ). 

The size of the space being searched in this problem is approximately 
n\ 

is equal to ^ — . Using Stirling’s approximation: \ogn\~ nlogn- n , it follows 

n — k\k\ 






which 



that log 



n'l 



: n log — k log — ^ — , comparing this to the number of steps required 

n—k n—k 



by the algorithm, we find that the number of steps in this algorithm, as in (4.1), varies 
as the square-root of the size of the solution space being searched. 



4.3 Multiple s & t States with the Same Uf ^ : 



/(x) is non-zero at j3 values of x, i.e. there are f-states. Some structure of the 
problem is specified in the following form. Assume that we have at our disposal a 
unitary transform U and a i-states such that between any t-state and any j-state 



is the same. The object is to find one of the r-states. This is accomplished by 
transforming the system into a superposition so that there is an equal amplitude in 
each of the t-states and zero amplitude elsewhere. After this, a measurement is made 
that projects the system into one of its basis states, this gives a t-state. 

The problem considered in this subsection is abstract in the sense U is an 
arbitrary unitary transformation. (4.4), (4.5) and (4.7) apply this to concrete problems. 

Solution: The approach is similar to the exhaustive search problem of (4.1). 
However, the analysis of section 2 has to be redone with the following three changes: 



(a) 



The starting state instead of being , is the superposition 




a-l 

X 



a=0 



- the 



amplitude in all states is equal to ~^= , and zero everywhere else. Assuming a to 

-Ja 

be a power of lipc = 2“), such a superposition can be easily created by the following 
procedure. Start with an a bit system with all bits in the 0 state. Do a W-H transform 

on the a bit system and then carry out a mapping from the 2“ states to the ^-states. 

(b) The operations invert the amplitudes in all 5-states & all t-states, 

respectively. 

(c) It can then be shown by an analysis similar to section 2, that after 
1 






0 \ 






operations of - I^U followed by a single application of U, the 




134 



L.K. Grover 



system reaches a superposition so that the amplitude is equal in all t-states and is zero 
everywhere else. Note that the number of operations is smaller by a factor as 
compared to the situation when there were single 5-states & single t-states (as in 
(4.1)). 



4.4 Problem 



f{x) is non-zero at p values of x, equivalently there are p f-states - the task is to 
find one of these. 

This is the problem of exhaustive search when there are multiple (p) 



solutions. A classical search would take an average of 0\ 






steps to find a solution. 



This section presents an 0\ 






step quantum mechanical algorithm. 



Solution By the definition of the W-H transform in section 3, W^q for any t is the 
same. Therefore if we choose s as the 0 state, then it follows by (4.3) that after 



0 \ 






repetitions of -I^WIfW followed by a single application of W, the system 



reaches a superposition such that the amplitude is equal in all the t states and zero 
everywhere else. Note the following three points regarding this scheme: 

As in (4.3), the operation inverts the phase for all p t-states. 

The above implementation requires p to be known in advance. 

The search time is ^J~p faster than the exhaustive search algorithm of (4.1). 

It is necessary to choose s as the 0 state, this is different from (4.1) where j 
could be arbitrary. 



4.5 Problem 

y(x)= 0 except at the single point t. Some information about t is available in the 
form of a n-bit strings, each of which differs from t in exactly k bits. 

This is in some sense the dual of (4.4). In that case there were multiple t- 
states but a single 5-state, while in this problem there are multiple 5-states and a single 
t-state. This kind of problem could occur in extracting a signal out of multiple noisy 
transmissions. 

Solution Let the a specified states be the 5-states. Initialize the system to a 
superposition of these states by the process described in (4.3)(a). After this, apply the 
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unitary transform U which applies the following unitary operation 



1 -^ 



1 -^ 



n—k 



to 



each 



quhit. 



As 



in 



(4.2), 



= 11 -- 



and 



L, I n, n-k A:, n-k ^ ... . . .-rr 

\UfA = — log log tor all 5-states. Also, since each ot the j-states differ 

2 n n k 

from t in exactly the same number of bits implies that has the same sign for all 5- 

states. The framework of (4.3) can now be used - this yields an algorithm that is -Ja 
times faster than that of (4.2). 



4.6 Multiple s-States & a Single f-State Such that Uf^ between Various s-States 
f Is Not Identical 

f{x)= 0 everywhere, except at the single point t. As in (4.3), some structure to the 
problem is specified via U. Assume that we have at our disposal a unitary transform U 
and various 5-states are specified. However, unlike (4.3), between various 5 and 
various t-states are not exactly equal. This case is qualitatively different from all of 
those considered so far because the analysis of section 2 or the modified analysis of 
(4.3), does not directly apply. 

This extends the noisy data-transmission problem of (4.5), to the case where 
there are multiple states specified that are close to the solution state but differ from it 
in varying number of bits ((4.5) required each of the given states to differ from the 
solution in exactly the same number of bits). 

Solution Assume the number of 5 -states to be a , further assume that a is a power 

of 2, i.e. a = 2‘^ . Consider V to be a unitary matrix which is a product of 3 simpler 
unitary matrices, i.e. V = V 1 V 2 U and V~^ = U~W 2 ^Vi^ . 

is a W-H transformation on a bits and a states where a = 2“ , 

V 2 maps the a = 2‘^ states generated by V\ onto the respective 5-states in N- 
dimensional state space, 

U is the available unitary transform on the N states. 

Let the initial state be the 0 state. As a result of V-^2 > amplitude in each of the 

5q states: 5g,5j ...5(^_j , becomes — ^ ; after U, the amplitude in the t-state is 

Va 
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a-l 



. By (4.0), it follows that after ^ r t/ G 



yfa 



a=0 



a-l 

a=0 



repetitions of g = -IqV IfV 



followed by a single application of V, the amplitude in the target state becomes 0(1). 

Many of the problems in the previous subsections can be seen to be particular cases 
of this. For example in case (7^^ are all equal, say to u, then the number of iterations 



becomes: = . The algorithm and bound of (4.5) immediately follow from this. 

ImIvoj 



4.7 Two Dimensional Search 



Two functions f{x,y) & g{x) are defined on the domain x = 0, 1, ...(A^- l), 
y = 0, 1, ...{N — I). f{x,y) is zero everywhere except at the unique point (h,t2)> 
g{x) is non-zero at M values of x including x = t^ (here M « N ). The problem is to 
find q & . 

Classically this problem would take Q.{nm) steps. The algorithm of (4.1), without 

using the function g(x) would take 0{n) steps. This section presents an o{yJ NM ) 
step algorithm. For a different analysis, along with a proof that the algorithm of this 
section is within a constant factor of the fastest possible algorithm, see [Structure]. 

Several variations of this problem are also briefly considered, these demonstrate 
how the techniques discussed in this paper can be applied to real problems. 

Solution First consider the function g{x), x = 0, 1, ...(A^- l) . By executing the 

algorithm of (4.4), with the f-states as the non-zero values of g(x), it is possible for 
the system to reach a superposition such that at each point at which g(x) is non-zero. 



the amplitude is , — and the amplitude is zero everywhere else. This is 

Vm 



accomplished by a sequence of of 




unitary transformations (4.4) - denote this 



V / 

composite unitary operation by {/j . Next keep the value of x the same and carry out 
the algorithm of (4.1) on the N values of y with the t- state corresponding to the non- 
zero value of the function /(x, y). This consists of a sequence of 6>(Viv) elementary 
unitary transformations, denote this composite unitary operation by {/ 2 . It follows 
from (4.1), that as a result of this operation sequence, in case the particle is at x = tj , 
it is also at y = t2. The amplitude of the system being in the desired state is therefore 
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Vm 



. By means of the unitary transformation U = U 1 U 2 , the system starting from a 



certain initial state reaches the desired state with an amplitude of 



1 



Vm ’ 



r-l 



Since {/ is a sequence of elementary unitary operations, it follows that U~ ' is a 
sequence of the adjoints of the same operations in the opposite order and can hence be 
synthesized. By applying the procedure of section 2, it follows that by repeating the 

sequence of operations Q = , o(-\/a^) times, the system reaches the target 

state with certainty. 

The total number of steps is given by the number of repetitions of Q (i.e. 



times the number of steps required for each repetition, (i.e. 0 



■ o{Jn)) 



which 



gives 



o(Vm)> 



o 



w 






- 6>(Viv)= 6>(Viw ). 



The above analysis easily extends to more general cases. For example consider the 
case, where the search- space is rectangular instead of square, i.e. the number of 
possible values for x is and for y is N 2 , The number of steps, instead of being 






now becomes 



o{yjN 2 M ). Alternatively 



consider the case where 



there is an rj dimensional space with the same number of points (n) in each 
dimension. Instead of just f{x,y) & g{x), there are now rj functions: 
f{xi,X 2 ,...x^}, gi(xi,X 2 ,...Xj,_i), g2{xi,X2,...Xj^_2), . . . , gr,-i{xi) with analogous 
definitions. It follows by a similar approach that the number of steps is now 

Another variation is when the function f(x,y) is non- zero at multiple points, say 
p points, and it is required to find one of these. In case each of the p points has a 
different value of x, the framework of (4.3) applies. Considering the unitary 



transformation U as UiU 2 , leads to an algorithm that requires 0\ 



NM 



steps. 



It is not clear how to derive an algorithm when some of the p points have the 
same value of x. 



5. Conclusion & Further Work 

[Search] demonstrated how to make use of quantum mechanical properties to 
develop exhaustive search kinds of algorithms, i.e. algorithms for problems that 
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lacked any structure. [Search] used subtle properties of a particular quantum operation 
called the W-H transform. Subsequently [Gensrch] extended this so that other 
quantum operations could be used instead of the W-H transform. 

Most interesting problems in computer science are concerned with the 
structure of problems and how to develop algorithms to take advantage of this 
structure. [Hogg] has previously suggested heuristic quantum mechanical algorithms 
for structured problems. This paper has given several examples of structured 
problems and how search-type algorithms can be extended to solve these ((4.2) 
through (4.7)) - quantitative closed form bounds were derived for the running time of 
these algorithms. The extensions have shown how to deal with the situation where 
there are multiple i-states (initial states) and a single t-state (target state) ((4.5) and 
(4.6)). Also, it is possible to deal with multiple t-states provided they are exactly 
symmetric ((4.3) and (4.4)). The next step would be to obtain a general algorithm with 
multiple f-states. 
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Abstract. Grover’s algorithm for quantum searching of a database is 
generalized to deal with arbitrary initial amplitude distributions. First 
order linear difference equations are found for the time evolution of the 
amplitudes of the r marked and N — r unmarked states. These equa- 
tions are s olved exactly. An expression for the optimal measurement time 
T ~ 0{^yj^Jr) is derived which is shown to depend only on the initial 
average amplitudes of the marked and unmarked states. A bound on the 
probability of measuring a marked state is derived, which depends only 
on the standard deviation of the initial amplitude distributions of the 
marked or unmarked states. 

Keywords: Quantum searching, Grover’s algorithm, exact solution. 

1 Introduction 

The power of Quantum Computation (QC) was most dramatically demonstrated 
in the algorithms of Shor, for the polynomial time solution of the factorization 
problem Q , and of Grover for a search which can find a marked element in 
an unsorted database of size iV, in 0{'/N) steps (compared to 0{N) steps on a 
classical computer). The importance of Grover’s result stems from the fact that 
it proves the enhanced power of quantum computers compared to classical ones 
for a whole class of problems, for which the bound on the efficiency of classical 
algorithms is known. This is unlike the case of Shor’s algorithm, since in spite 
of the fact that no efficient classical algorithm for the factorization problem is 
known, there is no proof that such an algorithm does not exist. 

A large number of related results followed Grover’s original paper |2|. Among 
these, the efficiency of Grover’s algorithm was analyzed and compared to the 
theoretical efficiency limit of quantum computers for such benchmark search 
problems as introduced (before Grover’s result |2|) by Bennett et al. The 
algorithm was recently shown to be optimal, i.e., to satisfy the theoretical limit 
0. Further developments include the use of Grover’s algorithm or slightly mod- 
ified versions of it as the essential step in algorithms that solve a variety of other 
problems such as quantum search for the median pf] and the minimum P| in a 
set of N items, as well as the collision problem 0 . It was also shown that other 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 140- nT71 1999. 
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search problems which classically require log 2 N evaluations (queries) of a black- 
box function, can be reduced to a single query using Grover’s approach rarni . 
Finally, it was shown that a simple closed formula describes the time evolution of 
the amplitudes of the generalized problem, which includes several marked states 
m As this work is directly relevant to ours, we briefly summarize some of its 
pertinent results. 

Let k(t) [l(t)\ denote the amplitude of the marked [unmarked] states in the 
database, r the number of marked states, and lo = 2 arcsin(-\/r/iV). It was 
shown by Boyer et al. CD that after t steps of the algorithm, the marked states’ 
amplitude increases as: k(t) = sin[w(t + 1/2)]/^^, while at the same time that 
of the unmarked states decreases as: l(t) = cos[w(t + l/2)]/y/N — r. Since N is 
large, the optimal time to measure and complete the calculation is thus after 
T = 0(y/N/r) time steps, when k(t) is maximal. This analysis relies on the 
fact that the initial amplitude distribution is uniform. However, in a variety of 
interesting cases it would be desirable to apply Grover’s algorithm to a non- 
uniform initial distribution. Generically, this could arise in situations where the 
search is used as a subroutine in a larger quantum computation, and the input to 
the algorithm can thus not be controlled. Another example would be cases where 
the given initial distribution over the elements is intrinsically non-uniform. 

In this paper we generalize Grover’s algorithm to the case in which the initial 
amplitudes are either real or complex and follow any arbitrary distribution. The 
time evolution of the amplitudes is solved exactly for general initial conditions, 
and the efficiency of the algorithm is evaluated. It is found that for generic 
initial conditions, the search algorithm still requires 0(yjN/r) steps, with only 
a constant factor compared to the case of a uniform initial distribution [ 3 ]. 

The paper is organized as follows. In Sec. |2| we define the modified Grover 
algorithm and derive difference equations for the time evolution of the amplitudes 
in it. We solve these equations exactly in Sec. 01 and analyze the results in Sec. 
0 A summary and conclusions are presented in Sec.0 



2 The Recursion Equations 

2.1 The Generalized Algorithm 

Our modified algorithm is essentially Grover’s original algorithm, but without 
the initialization step. It thus consists of the following stages: 

1. Use any initial distribution of marked and unmarked states, e.g., the final 
state of any other quantum algorithm (do not initialize the system to the 
uniform distribution). 

2. Repeat the following steps T times [an expression for T is given in Eq. <E3)]: 

A. Rotate the marked states by a phase of tt radians. 

B. Rotate all states by tt radians around the average amplitude of all states. 
This is done by applying the “inversion about average” operator, repre- 
sented by the following unitary matrix: 
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D 






3. Measure the resulting state. 



T! 3 

F - 1 if * = J 



2.2 The Dynamics 

We will now analyze the time evolution of the amplitudes in the modified algo- 
rithm with a total of N states. Let the marked amplitudes at time t be denoted 
by fci(t), i = 1, . . . , r and the unmarked amplitudes by i = r + 1, . . . , N, 
where the initial distribution at t = 0 is arbitrary. Without loss of generality we 
assume that the number of marked states satisfies 1 < r < N/2. We denote the 
averages and variances of the amplitudes by: 



marked: 



unmarked: 



kit) = ^E^fW 

7 — 1 


7 — 1 


(1) 


1 ^ 

'(*) = IvVT S '40 


1 ^ 

= E m-m? 


(2) 



j=r+l j=r+l 



The key observation is that the entire dynamics dictated by Grover’s algorithm 
can be described in full by the time-dependence of the averages. (The variances 
are defined above for convenience, as they are used later in a different context - 
see Section E3) Formally, let: 



C{t) = 



2 

iV 



N 



E 

j=l j=r+l 



^ [{N - r)l{t) - rk{t)] . (3) 



C{t) is thus the weighted average over the marked and unmarked states, with 
the minus sign accounting for the tt phase difference between them during the 
algorithm iterations. The following theorem then shows that all states evolve 
equally: 



Theorem 1. The time evolution of all amplitudes (of both marked and un- 
marked states) is independent of the state index, and satisfies: 



ki{t 1) = C{t) ki{t) i = l,...,r (4) 

hit + 1) = C{t) - hit) i = r + l,...,N (5) 



Proof. - This follows directly from the algorithm. Consider any marked state 
fci(t); this state is flipped to fc'(t) = —ki{t), so that the marked average becomes 
k'{t) = i ^j{i) = —k{t). The unmarked states, on the other hand, do not 

flip, so that the total average after the flip is: x{t) = k'{t) {N — r) l{t)] = 

C{f)/2. “Inversion about average” is by definition: fc'(f) ^ 2x{f) — k[{f) and 
hit) —> 2x(t) — hit). Therefore in total: hit) C it) hit) and hit) C(t) — 
hit). □ 
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From this it follows by averaging that: 

Corollary 1. The average marked and unmarked amplitudes obey first order 
linear coupled difference equations: 



k{t + l) = C{t) + k{t) (6) 

I{t + l) = C{t)-I{t). (7) 

These equations can be solved for k{t) and l{t), and along with the initial distri- 
bution this yield the exact solution for the dynamics of all amplitudes by using 
Eqs. ® and (jni). 



3 Solution of the Recursion Equations 

The recursion formulae can be solved by a standard diagonalization method for 
arbitrary complex initial conditions. Let: 

v(t) = {k{t),I{t)) , 



and define: 



N-2r , 2{N-r) 2r 

a = , b = , c = — . 

iV ’ TV ’ N 

The recursion equations (0) and (0 can be written as: 

v(t + l) = A.v(t), . 

Diagonalization of A yields a solution for v(t), as follows. Let S be the diagonal- 
izing matrix: 

A^ = S-1AS = (^q-®^^, A±=76±*“. 

Then w(t) = S~^ • v(t) satisfies: 

w{t + l) = A^ • w(t) , 

with solution: 

w(t) = ((A_)‘u;_(0),(A+)‘w+(0)) 

where w(0) = (ic_(0), ■u;+(0)). This yields k{t) and l{t) from v(t) = S • w(t). 
Diagonalizing A one finds: 

'y = + be = 1 

r 

cos Lo = a = 1 — 2 — , 



( 8 ) 

(9) 
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which is identical to the frequency found by Boyer et al. HD The eigenvectors 
of A are the columns of S: 



S = 






N 



= 



2 \ N-r 2 



2 \ N-r 2 



Using this: 



fern) = = 



2 \/ T^H^) + 5 ^( 0 ) 



so that: 



v(t) = S ■ 



This yields finally, after some straightforward algebra: 



— iujt 
iujt 



k{t) = fc(0) cosuit + Z(0) 
l{t) = l{0) cos Lot — fc(0) 



N-' 



N- 



sin wt 



■ sin wt. 



( 10 ) 

( 11 ) 



Together with Eqs. 0) and (0 this provides the complete exact solution to the 
dynamics of the amplitudes in the generalized Grover algorithm, for arbitrary 
initial conditions. 



4 Analysis 

Next we derive several properties of the amplitudes. 

4.1 Phase Difference 

The averaged amplitudes can be expressed concisely as follows (even when A(0) 
and ^0) are complex): 



where 



k{t) = asin(wt + cj)) 
l{t) = f3cos{ujt + (j)) 



( 12 ) 

(13) 



tan (j) = 



fc(0) I r 

N-r' 



r 

p^ = j{of + k{oy^ 



(14) 



which shows that there is & tl j 2 phase difference between the marked and un- 
marked amplitudes: when the average marked amplitude is maximal, the average 
unmarked amplitude is minimal, and vice versa. 
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4.2 Constant Variance 

Subtracting Eq. 0) from Eq. and subtracting Eq. o from Eq. (0 , one 
finds: 



ki{t + 1) — k{t + 1) = ki{t) — k{t) (15) 

h{t + 1) — l{t + 1) = (16) 

This means that: 

Aki = ki{t) — k{t) and Ali = — T{t)], (17) 

are constants of motion (time-independent). It follows immediately from the 
definition that the variances cr^ and af (cf. Eqs. (eq:marked) and (eq:unmarked)) 
too, are both time-independent. 

This allows us to simplify the expression for the time dependence of the 
amplitudes: 



ki{t) = k{t) + Aki (18) 

k{t) = I{t) + {-iyAk, (19) 

where Aki and Ali are evaluated at t = 0. 



4.3 Maximal Probability of Success and Optimal Number of 
Iterations 

The probability that a marked state will be obtained in the measurement at time 
t at the end of the process is P{t) = ^ bound on this quantity can 

be derived as follows. Since all the operators used are unitary, the amplitudes 
satisfy the normalization condition: 



r N 

Y,\h{t)\^+ ^ Ut)\^ = l (20) 

2=1 i—r-\-l 



at all times. Using (y — y)^ = y^ — y'^ {y is a random variable), we find from 
Eq. ( 0 : 

N N 

^ mt)|2 = (iV-r)af-k| ^ k{t)\V{N-r). 

2=r+l 2=r+l 

Let: 



Pmax =1-{N - r)af, 



( 21 ) 
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a time-independent quantity. Note that in the case of uniform initial distribution 
of amplitudes af = 0 and Pmax = 1- Now, P{t) = Pmax — {N — r)\l{t)\'^, so that: 

p{t) < Pmax (22) 

is the required bound. Using the exact solution, we can show that the Pmax 
bound is in fact tight. For, from Eq. (1 1 ,411 it follows that 1{T) = 0 when: 



u;T + 0=(j + l/2)7r, j = 0,l,2,... (23) 

At these times the bound is reached so that times T satisfying Eq. (12.411 are 
optimal for measurement. Note that this conclusion holds only if fc(0)//(0) is 
real. When fc(0)//(0) is complex, the bound is generally not reached since l{t) 
may never vanish. Collecting our results: 

Theorem 2. Given arbitrary initial distributions of r marked and N — r un- 
marked states, with known averages fc(0) and /(O) respectively, k{0)/l{Q) real, 
the optimal measurement times are after: 



T = 



{j + 1/2)7t - arctan ^ 



arccos (l — 2^) 

steps, when the probability of obtaining a marked state is Pi 

Eq. !&1\) . 



j = 0,l,2,... (24) 

as given by 



An important conclusion is that to determine the optimal measurement 
times, all one needs to know are the average initial amplitudes and the number 
of marked states. The more difficult case when these are unavailable will be con- 
sidered in a separate publication na. The expansion of Eq. ll24ll in r/N <C 1 (at 
j = 0) yields: 



T=-\^ + \^/Wr~Y^^MN + 0{rlN), (25) 



confirming that Grover’s algorithm converges in 0{^J N /r) steps for arbitrary 
distributions. The advantage of an initial amplitude distribution with a relatively 
high average of the marked states is manifested in the constant offset 
which may significantly reduce the required number of steps. 



5 Summary and Conclusions 

In this work we generalized Grover’s quantum search algorithm to apply for 
initial input distributions which are non-uniform. In fact, it was shown that by 
simply omitting the first step of Grover’s original algorithm, wherein a uniform 
superposition is created over all elements in the database, a more general al- 
gorithm results which applies to arbitrary initial distributions. To analyze the 
algorithm, we found that the time evolution of the amplitudes of the marked 
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and unmarked states can be described by first-order linear difference equations 
with some special properties. The most important of these is that all amplitudes 
essentially evolve uniformly, with the dynamics being determined completely by 
the average amplitudes. This observation allowed us to find an exact solution for 
the time-evolution of the amplitudes. An important conclusion from this solu- 
tion is that generically the generalized algorithm also has a O^y'N/r) running 
time, thus being more powerful than any classical algorithm designed to solve 
the same task. 

This work was initiated during the 1997 Elsag-Bailey - I.S.I. Foundation 
research meeting on quantum computation. 
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Abstract. In this paper we give a quantum mechanical algorithm that 
can search a database by a single query when the number of solutions 
is more than a quarter. It utilizes the generalized Grover operator of 
arbitrary phase. 



1 Introduction 

For N G Nlet = {0,l,...,iV — 1} denote the additive cyclic group of order 
N and let F" : Z;v ^ ^2 be the Boolean function computed by the oracle. The 
database search problem is to find some j G Zn such that F{j) = 1 under the 
assumption that such an j exists. We assume that the structure of F is unknown 
so that it is not possible to obtain a knowledge about F without evaluating it 
on Z^v. 

Grover ^ constructed a quantum mechanical algorithm that can solve the 
database search problem in expected time of order 0{y/N/t) where t = \{j G 
Zn : F{j) = 1}| 0 . Bennett et el. 0 have shown that the work of Grover is 
optimal up to a multiplicative constant. Especially when t = A/4 is known, the 
original Grover algorithm in 0 can search a solution only by a single query 0 . 
It uses the 7r-phase, i.e., marking the states by multiplying = —1. When t = 
A/2, by changing this phase to 7t/2, that is, by marking the states by multiplying 
gTrz /2 _ j modifying the corresponding diffusion transform according to this 
phase, the solution can be found with certainty after a single iteration 0. 

In this paper we generalize Grover algorithm in order to permit arbitrary 
phase. When t > A/4, we give generalized conditional phase and diffusion trans- 
form depending on t and then formulate a quantum mechanical algorithm that 
solves the database search problem in a single query. 

2 Generalized Grover Operator 

Let Bn = {|«)}aGZw be the standard basis of an n-qubit quantum register with 
A = 2" and Hn be the corresponding Hilbert space which represents the state 
vectors of the quantum system. Let I be any element in Z n- 

For 7 G K the conditional 'y -phase transform : Hn — > BIn is defined by 

Sf.,|j) = 

for j = 0, 1, . . . , A — 1. Let denote where F)(j) = Sji. 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 148-|i^^ 1999. 
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Let W; be any unitary transformation on satisfying 



Will) 



1 

7^ 



N-1 



Y. Ii> ^ 



For example, we may set Wq to be the Walsh-Hadamard transform. When N is 
not a power of 2 the approximate Fourier transform in can be used. 

For P gM. the P-phase diffusion transform T>f 3 : Hn ^ Td-N defined by 

^0 = 

= I + (e*^-l)P, 



where I is the identity matrix and P = (Pjk) is a projection matrix vciT-Ln with 
Pjk = 7 . Notice that the definition of D,g does not depend on 1. We note that 
Sf_j and Dp are unitary. 

Let us define the generalized Grover operator of {P,^) -phase Gf./ 3,7 • 

Hn by 



G_f,/3,7 — DpSp,-f 



When /3 = 7 we set G_f ^7 = Gf,-/,-/ 
Let 



A={j GZN\F{j) = l} and S = {j S Zjv|P(j) = 0}- 
Then t = |A|. For ko, Iq G C such that t|fcoP + (fV — t)\lop = 1 let us define 

\p{ko, lo)) = ko\j) + Y 

and let kj and Ij be the corresponding amplitudes after j iterations of the gen- 
eralized Grover operator of (/3, yj-phase; 

Then we have the following lemma. 



Lemma 1. For a nonnegative integer j after applying j -b 1 the generalized 
Grover operator of (P,^)- phase to the initial state \ip{ko,lo)), the state becomes 
IPikj+iPj+i)) where 



kj+i — 



-l)t + N 



N 






{e^f^ - 1){N - t) 



N 



Ij+i - 



o*/3 _ 



l)f 



{e^<^ - 1){N - t) + N 



Ij, 



h- 



( 1 ) 



N 



N 
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3 Quantum Database Search 



Theorem 1. Assume that /cq = /3 , 7 € [0, 27t]. Then h = 0 if and only if 

^ < t < N and /3 = 7 = cos“^ (l — . In this case, we have k\ = (e*'*' — l)A:o 

and / 3,7 G [tt/3, Stt/S] . 

Proof. By m we get 



ki 



h 



- l)(e*T' 



l)^ + eH /o. 



ko , 



(2) 



Considering the imaginary part of e the equation 

^ {(1 — cos P) siny + (cosy — 1 ) sin /?} = 0 



is equivalent to 



1 — cos P 1 — cos 7 
sin P sin 7 



(3) 



Considering the real part of e ^^h/lo, it follows from @ that the equation 



^{(l-cos/3)(l 



cos 7 ) + sin /3 sin 7 } — 1 = 0 



is equivalent to 



N 

cos P = cos 7 = 1 — — ■ 
^ ' 2t 



Thus we get /3 , 7 G [tt/3, 57 r/ 3 ], P = "f and t ^ by (0 again. Furthermore, 
by (0 we obtain fci = (e*'’' — l)A;o. This completes the proof. □ 

For the case of 7 r-phase in g, -D^ = -i+2P is an inversion about average 
operation and we have 

-T>^m,i)) = \p (-^fc + ^-^i, + f fc 



In this case, there is an explicit closed- form formula for kj and Ij |2| ; 

J kj = (-l)^-^sin(( 2 j -h 1)6») , 

[ = i-^y -^=^cos{{2j + 1)6) 

for all nonnegative integer j where the angle 6 is defined so that sin^ 6 = ^. 
Especially when t = N/4 we have Zi = 0. 
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Grover operator of |-phase was used in |2|. Since 



D 



.\i^{k,l)) = r/’ ( 



(i— l)i+AT j 
N ' 



N ' 






N 



N 



when t = -y we have 



GF,^\tp{k, k)) = \ip{{i - l)k, 0 )). 

When t G [iV/4,iV] by Theorem Q we have 

GF,-y\ip{ko,ko)) = |'i/'((e*^ - 1)^0, 0)) 

where the phase 7 is defined by 7 = cos“^ (l — . Thus a single iteration of the 

generalized Grover operator makes all the amplitudes of the bases in B vanish 
and we can find a solution by a single query. 
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Abstract. Let a classical algorithm be determined by sequential ap- 
plications of a black box performing one step of this algorithm. If we 
consider this black box as an oracle which gives a value /(a) for a query 
a, we can compute T sequential applications of / on a classical computer 
relative to this oracle in time T. 

It is proved that if T = 0(2"^^), where n is the length of input, then the 
result of T sequential applications of / can not be computed on quantum 
computer with oracle for / for all possible / faster than in time n{T). 
This means that there is no general method of quantum speeding up 
of classical algorithms provided in such a general method a classical 
algorithm is regarded as iterated applications of a given black box. 



1 Introduction 

In the last years many investigators have amassed a convincing body of evidence 
that a quantum device can be more powerful tool for computations than a clas- 
sical computer. This is because for the different problems there exist quantum 
algorithms which find a solution substantially faster than any known (or even 
any possible) classical algorithm (look, for example, at the works 
The latest advance in quantum speeding up is the method of quantum search 
proposed by L. Grover in the work mu. His algorithm takes 0{'/N) time when 
the classical search requires f2{N) time. In some particular cases (look in m) 
the time 0{'/N) for a search can be even reduced. It would be natural to expect 
that some more general method of quantum speeding up can take place for all 
classical algorithms with time complexity more than 0{n). 

One of the main general corollaries from the classical theory of algorithms 
is that if we know only a code of algorithm then in general case the unique 
way to learn a result of computations is to run this algorithm on a given input. 
Therefore, given a code of algorithm, generally speaking we can only use it 
as a black box to perform sequentially all steps of computations and no other 
analysis can yield their result. Thus we can regard a computation Yg — > Xi — > 
. . . — > Xt as iterated application of the same oracle / which gives sequentially 
Y,+i = f{Xs), s = 0, 1, . . . , T - 1, T = T(n) > 0(n). 

In view of this we assume that a general method of quantum speeding up 
of classical algorithms is a quantum query machine with oracle / which yields 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 152- rrCTl 1999. 
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the result Xt of computations in time ct{T), where a{T)/T — > 0 (T — > oo). 
However, we shall see that such a method does not exist. This demonstrates a 
value of every partial result about quantum speeding up because such results 
are all that can be done. 

Oracle quantum computers will be treated here within the framework of 
approach proposed by C. Bennett, E. Bernstein, G. Brassard and U.Vazirani in 
the work iFnmvi . They considered a quantum Turing machine with oracle as 
a model of quantum computer. In this paper we use slightly different model of 
quantum computer with separated quantum and classical parts, but the results 
hold also for QTMs. We proceed with the exact definitions. 

2 Quantum Computer with the Separated Quantum and 
Classical Parts 

Our quantum query machine consists of two parts: quantum and classical. Let 
uj* denotes the set of all words in alphabet ui. 

Quantum part. 

It consists of two infinite tapes: working and query, the finite set U of unitary 
transformations which can be easily performed by the physical devices, and 

OO 

infinite set F = (J of unitary transformations called an oracle for the length 

n—1 

preserving function / : {0, 1}* — > {0, 1}*, each acts on 2^" dimensional 

Hilbert space spanned by {0,1}^" as follows: F„|a,6) = |a, /(o)0&), a, 6 S 
{0, 1}", where 0 denotes the bitwise addition modulo 2. 

The cells of tapes are called qubits. Each qubit takes values from the complex 
1-dimensional sphere of radius 1: {zqO + zil \ zi,Z 2 € C, |zoP + l-^iP = !}• Here 
0 and 1 are referred as basic states of qubit and form the basis of . 

During all the time of computation the both tapes are limited each by two 
markers with fixed positions, so that on the working (query) tape only qubits 
vi,V 2 , ■ ■ ■ ,vt {vt+i,vt+ 2 , ■ ■ ■ , VT+ 2 n) are available in a computation with time 
complexity T = T(n) on input of length n. Put Q = {vi,V 2 , ■ ■ ■ ,VT+ 2 n}- A 
basic state of quantum part is a function of the form e : Q — *■ {0, 1}. Such a 
state can be encoded as |e(ui), e(u 2 ), . . . , e{vT+ 2 n)) and naturally identified with 
the corresponding word in alphabet {0, 1}. Let K = 2’^+^"’; eo, ei, . . . , ck-i be 
all basic states taken in some fixed order, H he K dimensional Hilbert space 
with orthonormal basis eo, ei, . . . , ex-i- "H can be regarded as tensor product 

0 2 0 • • • 0 T^T+ 2 n of 2 dimensional spaces, where Hi is generated by all 

possible values of Ui, i = 1, 2, . . . , T-l- 2n. A (pure) state of quantum part is such 
an element x G H that |a:| = 1. 

Time evolution of quantum part at hand is determined by two types of uni- 
tary transformations on its states: working and query. Let a pair G, U be some- 
how selected, where G C {1,2, ...,T -I- 2u}, U G U is unitary transform on 
2 card(G) dimensional Hilbert space. 

Working transform Wq^u on H has the form E 0 U' , where U' acts as U on 
0 Hi in the basis at hand, E acts as identity on ^ Hi. 
iGO i^G 



154 Y. Ozhigov 



Query transform Quj on Ti. has the form E ^ Ff, where acts as -F„ on 

T+2n T 

Hi and E acts as identity on ^ Hi. 

i=T+l i=l 

Observation of the quantum part. If the quantum part is in state x = 

K-l 

^ XiCi, an observation is a procedure which gives the basic state with prob- 
ability |Aip. 

Classical part. 

It consists of two classical tapes: working and query, which cells are in one- 
to-one correspondence with the respective qubits of the quantum tapes and 
have boundary markers on the corresponding positions. Every cell of classical 
tapes contains a letter from some finite alphabet ui. Evolution of classical part is 
determined by the classical Turing machine M with a few heads on both tapes 
and the set of integrated states of heads: Qw^Qq, Qo, ■ ■ We denote by h{C) 

the integrated state of heads for a state C of classical part. 

Let D be the set of all states of classical part. 

Rule of correspondence between quantum and classical parts has the form 
R -. D — > 2 {b 2 .....T+ 2 n} X u, where VC G R{C) = (G, U), U acts on 
dimensional Hilbert space so that U depends only on h{C), and the elements 
of G are exactly the numbers of those cells on classical tape which contain the 
special letter oq G uj. 

A state of quantum computer at hand is a pair S = {Q{S),C{S)) where Q{S) 
and C{S) are the states of quantum and classical parts respectively. 

Computation on quantum computer. It is a chain of transformations of the 
following form: 

So^ Si^ St, (1) 

where for every t = 0, 1, . . . , T — 1 C{Si) — > C{Si+i) is transformation deter- 
mined by Turing machine M, and the following properties are fulfilled: 

if h{C{Si)) = q^u then Q(5'i+i) = Wfl(c(Si))(0('S'i))> 
if h{C{S,)) = q, then Q(5,+i) = Qu/Q(5,)), 

if h{C{Si)) = qb then i = 0, Q{So) = eo, C{Sq) is fixed initial state, corre- 
sponding to input word a G {0, 1}", 
if h{C{Si)) = qo then i = T, 
in other cases Q{Si+i) = Q{Si). 

We say that this quantum computer (QC) computes a function F{a) with 
probability p > 2/3 and time complexity T if for the computation (^3) on every 
input a the observation of St and the following routine procedure fixed be- 
forehand give F{a) with probability p. We always can reach any other value of 
probability po > p ii fulfill computations repeatedly on the same input and take 
the prevailing result. This leads only to a linear slowdown of computation. 
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3 The Effect of Changes in Oracle on the Resnlt of 
Quantum Computation 

For a state = |si, S 2 , . . . , ST+ 2 n) of the quantum part we denote the word 
st+iSt+2 ■ • ■ ST+n by q{ej). The state S of QC is called query if h{C{S)) = qq. 
Such a state is querying the oracle on all the words q{cj) with some amplitudes. 
Put JC = {{),\, . . . ,K — \}. Let ^ = <3(5') = ^ AjCj. Given a word a S {0, 1}" 

jeK 

for a query state S we define: 

<5a(6= E 

j: q{ej)=a 

It is the probability that a state S is querying the oracle on the word a. In 
particular, ^ = 1- 

aG{0,l}” 

Each query state S induces the metric on the set of all oracles if for length 
preserving functions /, g we define a distance between them by 

ds(/,5) = ( E 

a- f(a)^g(a) 



Lemma 1 Let Quj, Qu^ he query transforms on quantum part of QC corre- 
sponding to functions f, g; S be a query state. Then 

|Qu^(5)-Qug(5)|<2d5(/,ff). 

Proof 

Put £ = {j G /C I f{q{ej)) ff((?(ey))}. We have: IQuy(S') - QUg(S')| < 

2( S ^ 2ds(/, g). Lemma is proved. 

j&c 

Now we shall consider the classical part of computer as a part of working 
tape. Then a state of computer will be a point in dimensional Hilbert space 
Hi. We denote such states by ^,x with indices. All transformations of classical 
part can be fulfilled reversibly as it is shown by C. Bennett in the work |Ei|. This 
results in that all transformations in computation © will be unitary transforms 
in Th- At last we can join sequential steps: Si — > Si+i — > . . . — > Sj where 
Si — > 5'i+i, Sj — > Sj+i are two nearest query transforms, in one step. So the 
computation on our QC acquires the form 

Xo — > Xi — > • ■ • — ^Xt, (2) 

where every passage is the query unitary transform and the following unitary 

Qu j 

transform Ui which depends only on i: Xi — ^ Xi — Xi+i- We shall denote 
t^i(Qu/(6) by Vij{^), then Xi-ei = VijiXi), * = 0,l,...,t- 1. Here t is the 
number of query transforms (or evaluations of the function /) in the computation 
at hand. Put da(C) = \/(5o(6- 
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Lemma 2 If xo — Xi — ^ ■ • • — ^ Xt is a computation with oracle for f, 
a function g differs from f only on one word a € {0, 1}" and xo — > xi — ^ 
. . . — > Xt is O' computation on the same QC with a new oracle for g, then 



t-i 

IXt -X'tl < 2^da(x*)- 

i=0 

Proof 

Induction on t. Basis is evident. Step. In view of that Vt-i^g is unitary, Lemma 
I and inductive hypothesis, we have 

Ixt - Xt\ = \Vt-ijixt-i) - i^t-i,g(xLi)l < 

\Vt-ijixt-i) - Vt-i,g{xt-i)\ + \Vt-i,g{xt-i) - ^^t-i,ff(xLi)l < 

2da(xt-i) + Ixt-i - Xt-il = 2da(xt-i) + 2 X) da(xi) = 2 X da(Xi)- 

2 = 0 2=0 

Lemma is proved. 



4 Main Result 

For a length preserving function / a result of its iteration is defined by the 
induction on k\ is identity mapping, = fif^^^{x)). 

Theorem 1 There is no such QC with oracle for f that for some functions 
t{n),T{n) : t{n)/T{n) — > 0 (n — > oo), T{n) = and every f QC 

computes applying only t{n) evaluations of f. 

Proof 

Suppose that it is not true and some QC with oracle for / computes (0) 

applying only t{n) evaluations of /, where t{n)/T{n) — > 0 (n — > oo), T(n) = 
0(f2PCf and obtain a contradiction. 

Let / : {0,1}* — > {0,1}* be such length preserving bijection that for 

every n = 1,2,... the orbit of the word 0 = 0" contains all words from {0, 1}". 
Let an oracle for / be taken for the computation of /’^^^O) on our QC. This 
computation has the form 0 where t/T — > 0 (n — > oo). Let n be sufficiently 
large so that 5t < T. 

Now we shall define the lists of the form {ft, fi,Ti,Xi) where is a state from 
^ij ICil = Ij /i : {0i 1}* — {0; 1}* is length preserving function, Xi G % C 
{0, 1}" by the following induction on i. 

Basis: i = 0. Put = Xo, /o = /, xo = 0,Tq = {0, 1}". 

Step. Put 

Ci+l “ 1^,/i (C*), 

%+l Ri, Ri = {a \ ^a(^i+i) < 7^}, 

fi+i differs from fi at most on one word Xi where we define Xi+\ = fi+\{xi) 
such that for all s = 1, 2, . . . , T f}_^\{xi) G 
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Note that 2"- — card(Ri) < T“. Therefore we can chose such that Xi+i = 
where j < {i + 1)TT“. It is possible for every f = 1, 2, . . . , t — 1 if a < 5 
and n is sufficiently large, because T = 0(2t). 

We introduce the following notations: Vi = Vij^ , V* = Vij. . Let the unitary 
operator U* be introduced by the following induction: V^{x) = Vo(x), U*(x) = 
and the unitary operator Vi be defined by Vo = Vq, Vi{x) = 

U/(U,_i(x)). Then 6+1 = F*(eo). 

Put Co = Co, e-+i = P*(eo), d. = 16 - a A = |P;(6) - It 

follows from the definition that fi differs from ft at most on the set Xi = 
{xi, Xi+i, . . . , X(_i} where Va G Xi (5a (6) < Consequently, applying Lemma 
1 we obtain 



Z\, < 



2fV2 
2ia/2 ■ 



( 3 ) 



Lemma 3 di < ^k- 

k<i 

Proof 

Induction on i. Basis follows from the definitions. Step: 

a,+i = |u.(6) - PH6)I = |p;(Pz-i(6)) - y{v^-^{^o))\ < 
< |p;(6) - - Pz(e')i = A + 5.. 



Applying the inductive hypothesis we complete the proof. 
Thus in view of (3) Lemma 3 gives 



V* = 1, . . . , t di < 



2*fC2 

J'a/2 



( 4 ) 



It follows from the definition of the functions fi that \/i < t (5xt(6) 
Taking into account inequality 0)> we conclude that for x = xt 

d.(6 - i'i) < d,(6) < d,(6) < d,(6 - 6') + d.(6). 

Hence we have 



d.(6') < 



3f3/2 

J^a/2 * 



< 



1 

-pOC • 



( 5 ) 



Now we can change the value of the function ft only on the word Xt and 
obtain a new function (f such that </fI^I(0) //^^(O). Therefore, if 6 — ^ 

6^ — > • . • — > ft Is the computation of (0) on our QC with oracle for (/>, 
then we have 



\f't-fn>i/y 

On the other hand, Lemma 2 and inequality 



give 



6f5/2 



<2^d,(6')< ^ <1/4 






(6) 



for a > 5 and sufficiently large n, which contradicts to (0. Theorem 1 is proved. 
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If the time complexity of classical computation exceeds we can es- 

tablish a lower bound for the time of quantum simulation as 

Theorem 2 For arbitrary function T(n) there is no such QC with oracle for f 
that for some function t{n) : t^ /T — > 0 (n — > oo) QC computes for 

every f applying only t evaluations of f. 



Proof 

Suppose that 8t < \/T. Let / be selected as above. Put (0) k = 

0,1,..., T. Define the matrix A = (oy) with the following elements: a^- = 

T 

Sfj{xi), i = 0, 1, . . . , t; j = 0, 1, . . . , T. We have for every i = 0, ... ,t Yl 



t T T t 

1, consequently t > Y Y ~ Y Y there exists such rS{0,l,...,T} 

z— Oj— 0 0 z— 0 



t 

that X) — Y- 

i=0 

Changing the value of / only on the word /’’ we obtain a new function g 
where ^ f^^^{0)- Let xo — x'l — ^ ■ • ■ — ^ Xt be computation on QC 

with oracle for g. Then we have 



IXt-x'tl>l/4. (7) 

t 

On the other hand Lemma 2 gives \xt — x't\ ^ ‘^Y \YYr < ‘2\/tYo-iT < 

i=0 

< 1/4 for sufficiently large n, which contradicts to (0). Theorem 2 
is proved. 

Note that the lower bound as for the time of quantum simulation 

follows immediately from the lower bound for the time of quantum search es- 
tablished in the work mm- 
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Abstract. Quantum analog computing is based upon similarity between 
mathematical formalism of a quantum phenomenon and phenomena to be 
analyzed. In this paper, the mathematical formalism of quantum resonance 
combined with tensor product decomposability of unitary evolutions is mapped 
onto a class of NP-complete combinatorial problems. 



1. Introduction 

The competition between digital and analog computes, i.e.., between computations 
and simulations, has a long history. During the last fifty years the theory of 
computation has been based implicitly upon classical physics as idealized in the 
deterministic Turing machine model. However, despite the many successes of digital 
computers, the existence of so called hard problems has revealed limitations on their 
capabilities, since the computational time for solving such problems grows 
exponentially with the size of the problem. 

It was well understood that one possible way to fight the “curse” of the 
combinatorial explosion is to enrich digital computers with analog devices. In 
contradistinction to a digital computer, which performs operations on numbers 
symbolizing an underlying physical process, an analog computer processes 
information by exploiting physical phenomena directly. It is this problem solving via 
direct simulation that allows an analog approach to reduce the complexity of the 
computations significantly. This idea was stressed by Feynman [1] who 
demonstrated that the problem of exponential complexity in terms of calculated 
probabilities can be reduced to a problem of polynomial complexity in terms of 
simulated probabilities. Conceptually, a similar approach can be applied to the whole 
class of NP-complete problems. But is it possible, in general, to find a new 
mathematical formulation for any intractable problem in such a way that it becomes 
tractable? Some experts in computational complexity believe that, in the spirit of the 
Godel theorem, there always exist computational problems such that every 
mathematical formulation that captures the essence of the problem is intractable [2] . 
At this step, we cannot prove or disprove this statement . 
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There are remarkably few (actually three) papers in which quantum analog 
computing is discussed. The first one [3] introduces a hypothetical quantum device (a 
slot machine) for solving a traveling salesman problem. As shown by the author, 
such a device, although intellectually appealing, requires an exponentially large 
number of measurements to get the right answer. The second paper [4] discusses the 
capacity of a hypothetical quantum perception. In the third paper [5], a concept of 
quantum recurrent networks combining quantum conventional networks with 
classical feedback loops was introduced and discussed. 

In this paper an attempt is made to exploit combinatorial properties of tensor 
product decomposability of unitary evolution of many-particle quantum systems for 
simulating solutions to NP-complete problems, while the reinforcement and selection 
of a desired solution is executed by quantum resonance. 



2. Quantum Resonance. 

Consider a quantum system characterized by a discrete spectrum of energy 
eigenstates subject to a small perturbing interaction, and let the perturbation be 
switched on at zero time. The Hamiltonian of the system can be presented as a sum 
of the time-independent and oscillating components: 

// = //q sinftlf Jft) £o«l (1) 

0 ) 

where Hq and are constant Hermitian matrices, (W is the frequency of 
perturbations, and €^Cl)J is the spectral density. 

The probability of a transition from state k to q in the first approximation is 
proportional to the product [6] : 

Pk, 

Here (Pj are the eigenstates of Hq : 

Hq(Pj = Ej(Pj j = l,2---N (3) 

where Ej are the energy eigenvalues, 

ha^^ = E^- E^,k,q = 1,2- --N (4) 






( 2 ) 



and h is the Planck constant. 
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The resonance, i.e., a time-proportional growth of the transition probability 
occurs when CO = : 



_ 






( 5 ) 



3. Combinatorial Problems 

Combinatorial problems are among the hardest in the theory of computations. 
They include a special class of so called NP-complete problems which are considered 
to be intractable by most theoretical computer scientists. A typical representative of 
this class is a famous traveling- salesman problem (TSP) of determining the shortest 
closed tour that connects a given set of n points in the plane. As for any of NP- 
complete problem, here the algorithm for solution is very simple: enumerate all the 
tours, compute their lengths, and select the shortest one. However, the number of 
tours is proportional to n! and that leads to exponential growth of computational time 
as a function of the dimensionality n of the problem, and therefore, to computational 
intractability. 

It should be noticed that, in contradistinction to continuous optimization problems 
where the knowledge about the length of a trajectory is transferred to the neighboring 
trajectories through the gradient, here the gradient does not exist, and there is no 
alternative to a simple enumeration of tours. 

The class of NP-complete problems has a very interesting property: if any single 
problem (including its worse case) can be solved in polynomial time, then every NP- 
complete problem can be solved in polynomial time as well. But despite that, there is 
no progress so far in removing a curse of combinatorial explosion: it turns out that if 
one manages to achieve a polynomial time of computation, then the space or energy 
grow exponentially, i.e., the effect of combinatorial explosion stubbornly reappears. 
That is why the intractability of NP-complete problems is being observed as a 
fundamental principle of theory of computations which plays the same role as the 
second law of thermodynamics in physics. 

At the same time, one has to recognize that the theory of computational complexity 
is an attribute of a digital approach to computations, which means that the monster of 
NP-completeness is a creature of the Turing machine. As an alternative, one can turn 
to an analog device which replaces digital computations by physical simulations. 
Indeed, assume that one found such a physical phenomenon whose mathematical 
description is equivalent to that of a particular NP-complete problem. Then, 
incorporating this phenomenon into an appropriate analog device one can simulate 
the corresponding NP-complete problem. In this connection it is interesting to note 
that, at first sight, NP-complete problems are fundamentally different from natural 
phenomena: they look like man-made puzzles and their formal mathematical 

framework is mapped into decision problems with yes/no solutions. However, one 
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should recall that physical laws can also he stated in a “man-made” form: The least 
time (Fermat), the least action (in modifications of Hamilton, Lagrange, or Jacobi), 
and the least constraints (Gauss). 

In this paper we will describe how to map a combinatorial decision problem into 
the physical phenomenon of quantum resonance on a conceptual level, without going 
into details of actual implementations. 

Let us turn to the property (5) which can be mapped into several computational 
problems, and, for the purpose of illustration, choose the following one: given n 

different items to be distributed over n places; the cost of an p item put in r y 
place is ; in general, the costs can be positive or negative, and there are no 

restrictions to how many different items can be put at the same place. Find yes/no 
answer to the following question: is there at least one total cost whose absolute value 
falls into an arbitrarily given interval. 

This problem is typical for optimal design. Since the cost of a particular 
distribution is expressed by the sum 

j = N = n" (6) 

M 

classically one has to compute all the n" sums (8) in order to find is there at least 
one E such that 

a^ < (7) 

where and r ?2 are arbitrarily prescribed positive numbers. 

(r 1 

Since costs can be positive or negative, the absolute value in Eq. (7) 

represents a global constraint, and therefore our problem belongs to the class of so 
called constraint satisfaction problems which are the hardest among other 
optimization problems. The constraint (7) prevents one from decomposing the 
solution into smaller-size sub-problems. As shown by Andre Stechert this problem 
can be mapped into the partition problem , and therefore, it is NP-complete. 

Now we will demonstrate how this problem can be solved by the quantum device 
described above in one computational step. 

First, let us represent the unitary matrix U q corresponding to the time-independent 
Hamiltonian 

Uq = ( 8 ) 

as a tensor product of n diagonal unitary matrices of the size n x n: 



(9) 
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where 



c',= 






0 



( 10 ) 



Then the unitary matrix Uq in (9) will be also diagonal 
and 



(Er 



Ho = 



0 



A 



0 ••• E 



, N = n 



( 11 ) 



N J 



while E j is expressed by Eq. (6). 

Hence, if one select A';’ in (8) as the costs of a item put in a place, 

then the eigenstates Ej of the Hamiltonian EIq will represent costs of all = n" 

possible distributions (8). 

Now we have to choose the perturbation of the Hamiltonian, (see Eq. (1)) Eor that 
purpose assume that initially the quantum device is in a certain base state k, whose 
energy Ej^ does not belong to the interval (7), i.e.. 



or \E^\> a 2 

and select H^and as follows: 



where 



H, = P 

fl 

p= ... 



( 12 ) 



(13) 



O 

ly 



(14) 



and 



^H = 



= Const if — < ft)< ^ 



30 

0 



n 

otherwise 



n 



(15) 
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Here, for the sake of concreteness, the initial state was selected such that: 



E^-a, > E^- 



(16) 



Suppose that the given interval (ij , (?2 contains at least one total cost \E^\^ from 
the set (6), i.e., \Ea satisfies the inequality (7). Then, according to Eqs. (5) and (14), 

the resonance transition from the initial state E^^ to the state E^ (or other states 

satisfying (7)) will occur with the probability one. Indeed, in the presence of a 
resonance, the probability for non-resonance transitions are vanishingly small if 
£q « 1 (seeEq. (1)). 

However, if the given interval (ij , (?2 does not contain any costs \e^ | from the set 

(6), then according to Eqs. (5) and (14), there will be no resonance transitions at all, 
and therefore, with the probability one the quantum device will stay in the initial 
state. 

Thus, in one computational step, the problem is solved in a deterministic way. As 
follows from Eq. (5), the time required for probability of the resonance transition 
from the state k to q to become close to one has the order: 



t* 
















(17) 



4. Conclusion 

Thus, it has been demonstrated how a “man-made” problems of exponential 
computational complexity which is hard to handle by algorithmic methods are solved 
by exploiting a strongly pronounced physical phenomena: quantum resonance. 

The main advantage of the proposed approach is in exponential speedup of 
solutions to NP-complete combinatorial problems. Two fundamental physical 
phenomena contribute to it: quantum resonance and tensor-product decomposability 
of the underlying unitary matrix. 

Quantum resonance allows one to represent all the possible solutions to the 
problem as a set of competing dynamical processes: energy exchanges between pairs 
of quantum eigenstates. The mathematical formalism of quantum resonance provides 
a storage for these processes: the transition matrix (see Eq. (2)) where each 

process is labeled through the corresponding transition probability. 

Tensor-product decomposability is a fundamental property of the Schrodinger 
equation for multi-particle systems. Due to its effect, the number of stored solutions, 
i.e., the number of transitions is exponentially larger than the number of the 
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input parameters (see Eq. (6)) and that is what directly contributes into exponential 
speedup and capacity. 

In order to make these two physical phenomena work together, one has to choose 
the Hamiltonian of the quantum system such that the optimal solution is the winner in 
the competition with other solutions, i.e., that its transition probability is the largest. 
This is achieved by selecting the oscillating part of the Hamiltonian in the form of 
(14). 

It should be emphasized that the solution of one NP-complete problem opens up a 
way to solve every NP-complete problem in polynomial time. 
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Abstract. We consider the relationship between computational com- 
plexity and the laws of physics, and show that nonlinear generalizations 
of quantum mechanics allow for polynomial-time solution of NP-complete 
and #P problems. Whether or not experiments ultimately reveal small 
nonlinearities in the evolntion of quantnm states (indeed, we believe that 
they most likely will not), these results underscore the close relationship 
between physics and computation. 



Not too long ago, it was generally believed that computational complexity 
was a concept independent of any particular computing model. According to 
the strong form of Church’s thesis, any reasonable computing device could be 
simulated by any other, using only polynomial resources. The degree to which 
this point of view was accepted is made evident by the degree of surprise which 
greeted the counterexample: the discovery that quantum computers could solve 
in polynomial time problems thought to be classically intractable, such as fac- 
toring large numbers. [Q 

Yet perhaps the failure of Church’s thesis in its strong form should not have 
been so surprising. After all, the idea is built upon elaborate and non-obvious 
proofs that one computing model can simulate another. And since a computer is 
in the end a physical device - whose behavior is governed by the laws of physics 
- it is only natural that the power of a machine should depend in large part 
upon the physics which underlies its operation. All the recent work in quantum 
computing and quantum information has now made this concept quite evident. 

In this paper, we explore the possibility of another physical model of compu- 
tation - one that relies upon nonlinear quantum mechanics. The idea that under 
some circumstances the superposition principle of quantum mechanics might be 
violated - that is, that the time evolution of quantum systems might be slightly 
nonlinear - has been previously suggested in various contexts Q| ^ Q |H] 101 - Such 
nonlinearity is purely hypothetical: all known experiments confirm the linearity 
of quantum mechanics to a high degree of accuracy Pj 0 |0j [11^ . Further, nonlinear 
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quantum theories have often been controversial and frequently have had major 
theoretical difficulties E2C3E1- Nevertheless, the implications of nonlinear 
quantum mechanics on the theory of computation are profound. In particular, 
we explain why one can exploit nonlinear time evolution so that the classes of 
problems NP and (including oracle problems) may be solved in polynomial 
time. Additional algorithms and further details, including an explicit demon- 
stration of how to construct nonlinear quantum logic gates using the Weinberg 
model, are available in m. 

Loosely defined, the class NP contains all problems for which it is possible to 
verify a potential solution in polynomial time. These include all problems in the 
class P (those that can be solved in polynomial time) as well as the NP-complete 
problems, e.g., the traveling salesman, satisfiability, and sub-graph isomorphism, 
for which no known polynomial time algorithms exist. One natural way to ap- 
proach these problems on a quantum computer is to create a superposition of 
every possible potential solution, and then try to determine if one of those po- 
tential solutions is in fact a true solution. In some sense, this technique nicely 
mimics the theoretical behavior of a non-deterministic Turing machine. In order 
to both simplify and generalize the result, it is convenient to replace the actual 
NP problem with an oracle problem, stated as follows: consider an oracle, or 
“black box”, which calculates a function that maps n bits into a single bit; i.e., 
it takes an input between 0 and 2"’ — 1 and returns either 0 or 1. We need to 
determine if there exists an input value x for which f(x) = 1. It is easy to see 
that a polynomial time algorithm to solve this problem can be used to solve all 
problems in the class NP. (Note, however, that the converse is not necessarily 
true - the NP complete problems contain structure, whereas the function defined 
above is completely arbitrary. Thus this oracle problem is in fact a harder prob- 
lem than those in NP, because it clearly requires exponential time on a classical 
Turing machine.) For simplicity we will restrict ourselves to the case where there 
is at most one value x for which f(x)=l. 



^ It may be objected that many if not all models of nonlinear quantum mechanics 
allow for superluminal communication - and thereby backward-in-time signaling, 
which would then allow in some peculiar sense for the solution of all problems in 
zero or even negative time. (Although they would still require an exponential number 
of calls to the oracle). From the authors’ perspective, this fact is simply evidence 
that nonlinear quantum mechanics is most likely wrong. Indeed, we feel that the 
results described here should probably also be viewed as evidence that quantum 
mechanics is linear, rather than as a blueprint for a potential device. However, we 
would like to point out in addition that the ” backward-in-time” computer relies 
upon a combination of relativity and nonlinear quantum mechanics, whereas the 
algorithms which we describe here and elsewhere do not. That is, they could run on 
a non-relativistic nonlinear quantum computer. In other words, these algorithms rely 
upon a fundamentally different aspect of nonlinear quantum mechanics. We do not 
know if it is possible to create a self-consistent - that is, causal - theory of nonlinear 
quantum mechanics. However, we do know this: in the end, only experiment can tell 
what are the correct physical laws. 
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One might attempt to solve this oracle problem on an ordinary quantum 
computer using the following technique. First, create a superposition of all the 
input states: 

^ = ( 1 ) 

Next, use the oracle to calculate f(i) for each |t) in parallel: 

V ^ i=0 

Although the final qubit in some sense “knows” the solution to the problem, a 
measurement of this qubit will yield |1) with either zero probability if there is no 
solution or an exponentially small probability if there is a solution. It is therefore 
necessary to enhance the amplitude of the |1) component of the superposition 
by an exponentially large factor, in order to distinguish the two cases. One idea 
is to try to increase the number of states with a |1) rather than increase the 
amplitude of the particular state |t) for which f(i) = 1. Imagine comparing the 
states |i) in pairs, according to the last bit of |i). Looking at just the last bit of 
|i) and the final qubit f(i), we see one of the following states: 



(a) |00) + |11) 

(b) |01) + |10) (3) 

(c) |00) + |10) 

The last case occurs most frequently. What we’d like to do is map these states 
into new states using the following transformation: 

(а) |00) + |11)-^|01) + |11) 

(б) |01) + |10)^|01) + |11) (4) 

(c) |00) + |10) — ^ |00) + |10) 

This transformation is like an AND gate - it ignores the first qubit and places 
the second qubit in the state |1) if and only if either of the original components 
had the state |1) for the second qubit. Performing this transformation on the 
superposition of all |t) will leave every state unaffected except the state which 
neighbors the solution |a;). This state will then pick up a |1) in place of the 
|0) which it originally had. If we then compare states in pairs according to the 
second bit of |i), the number of states with a |I) for the final qubit will double 
again. Repeated application of this process would then leave the final qubit 
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unentangled with the first n qubits: it would be either in the pure state |0) if 
there are no solutions to the problem, or the pure state |1) if there had existed 
some state \x) for which f(x)=l. A measurement of this qubit thereby reveals 
the answer to the problem. 

Of course, this transformation cannot be accomplished using an ordinary 
quantum computer, because it is nonlinear. That this is the case can be easily 
seen by the fact that in cases (a) and (c) the initial states are non-orthogonal, but 
the final states are orthogonal. Hence the desired transformation cannot possibly 
be linear. One is tempted to try to patch this problem in a variety of ways. One 
possibility is to imbed this transformation in a larger Hilbert space and hope 
that a projective subspace might reduce to the desired nonlinear transformation. 
Unfortunately, this approach cannot succeed. The reason is that different ele- 
ments of the superposition need to interfere with each other in later stages of the 
algorithm. If the “linearized” version of the transformation results in extraneous 
“garbage” qubits, these will prevent the states from interfering with each other 
in future iterations. Equivalently, one might hope that the non-unitary evolution 
associated with the measurement process might suffice to accomplish the neces- 
sary transformation, but this will of course fail for the same reason. One can 
also try to hide the extraneous information in the phases of the states. Although 
this appears promising at first, more careful analysis reveals essentially the same 
difficulties 0 

We see, therefore, that the potential application of a nonlinear quantum logic 
gate arises naturally from a fairly straightforward approach to the NP oracle 
problem. From an intuitive perspective, however, it is not exactly clear why it 
is that nonlinearity is important, beyond the fact that the gate which we desire 
for our algorithm does not happen to be linear. We can get a better feeling for 
this from a slightly different perspective. 

Consider the shortest-path version of the traveling salesman problem, and 
a classical algorithm that finds “pretty good” solutions, such as simulated an- 
nealing. Implement this algorithm on a quantum computer, and initialize the 
quantum computer in a state which as before is a superposition of all possible 
inputs. After the algorithm has finished, the result will be a quantum computer 
that exists in a superposition of all the various local minima that are found by 

^ One can also imagine approaching the problem with phases from the very beginning, 
thereby avoiding the need for the nonlinear gate. After calcnlating f(i), multiply 
the phase of the solution state by -1 and then reverse the computation of f(i). The 
computer would then be in an equal superposition of all |i>, with the state |x>for 
which f(x) = l having opposite phase. Pairs of states containing two |i>of opposite 
phase are orthogonal to those for which both |i>are of the same phase, so it is 
possible to reverse the phase of the pair, thereby transfering the minus sign from the 
solution |x>to its partner state. Repeating the process which created the phase in 
the first place would then leave two states with negative phase. By iterating through 
each bit of |i>(as in the previous algorithm), one can continue the process until a 
substantial fraction of the states have negative phase. This situation can be easily 
detected. Unfortunately, each iteration takes twice as long as the previous one, so 
the algorithm described in this footnote requires exponential time. 
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searching from every possible initial state. A measurement would then reveal any 
one of these local minima, but most likely not the shortest path. Thus, what one 
would like to do before the measurement is to compare the various states with 
each other and shift the amplitude into the states representing shorter paths. Put 
differently, we would like an algorithm which acts in a space that is restricted 
to only those quantum states which already have non-zero amplitude. Unfortu- 
nately, the linear transformations allowed by ordinary quantum mechanics have 
no way for a given state (or more precisely, component of a superposition) to 
“sense” the amplitude of other states. This is the aspect of nonlinear quantum 
mechanics which allows for the solution of NP-complete problems. 

Returning to the algorithm described above, it is clear that if one could 
obtain the necessary nonlinear transformation, one could find the answer to an 
NP-complete problem in polynomial (in fact, linear) time, and using only a single 
evaluation of the oracle. It may be objected that the nonlinear operator described 
above appears arbitrary and unnatural: indeed, it was selected exactly so as to 
be able to solve the stated problem. However, the apparently arbitrary opera- 
tion can be built using ordinary unitary operations and much simpler and more 
‘natural’ single qubit nonlinear operators (that is, to the extent that any nonlin- 
ear operation in quantum mechanics can be considered ‘natural’). One possible 
technique for generating the transformation would be to use the following steps: 
first, act on the two qubits with the unitary operator 
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10 0 1 
oil 0 
0 1-10 
10 0 -1 



This transforms the states above as follows 



( 5 ) 



(a)^[|00) + |ll)]^|00) 

(&)^[|01) + |10)]^|01) (6) 
(c) ^ [|00) + |10)] ^ i [|00) + |01) - |10) + 111)] 

Next, operate on the second qubit with a simple one qubit nonlinear gate n_ 
that maps both |0)and |l)to the state |0). Thus 



(а) ^[|00) + |ll)] 

(б) ^[|01) + |10)j 

(c)^[|00) + |10)j 



| 00 ) 

| 00 ) 

1 ^) 



( 7 ) 
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The third final state is unknown because we have not bothered to specify how 
the non-linear gate acts on the state |00>-|- |01>- |10>-|- |11>. This omission 
thereby allows for flexibility in choosing the gate n_ . Whatever the state | A>may 
be, we can perform a unitary operation that will transform the first qubit into 
the pure state |0)while leaving the state |00>in place. The computer is then in 
one of the following states 



(a) |0)|0) 

(b) |0)|0) (8) 

(c) |0)(x|0) + 2 /|l)) 

A second non-linear gate n+ is now required that will map the state x|0)-f 
y|l)to the state |1) (for the particular values of x and y which result from the 
above steps but not necessarily for arbitrary x and y), while leaving the state 
|0)unchanged. After this gate is applied, the transformation resulting from the 
steps described so far is then: 



(а) ^[|00) + |ll)] — |00) 

(б) ^[|01) + |10)] — |00) (9) 

(c)^[|00) + |10)] — |01) 

The desired two qubit transformation is then easily obtained with a NOT 
gate on the second qubit and a 7t/2 rotation on the first qubit. 

Having thus shown how to generate the needed two qubit gate, the question 
is now reduced to that of generating the simpler single qubit gates n_ and fi+ . If 
one considers the state of a qubit as a point on the unit sphere, then all unitary 
operations correspond to rotations of the sphere; and while such rotations can 
place two state vectors in any particular position on the sphere, they can never 
change the angle between two state vectors. A nonlinear transformation corre- 
sponds to a stretching of the sphere, which will in general modify this angle. The 
desired gates fi_ and are two particular examples of such operations. Except- 
ing perhaps certain pathological cases (e.g., discontinuous transformations), it is 
evident that virtually any nonlinear operator, when used repeatedly in combina- 
tion with ordinary unitary transformations (which can be used to place the two 
state vectors in an arbitrary position on the sphere), can be used to arbitrarily 
increase or decrease the angle between two states, as needed to generate the 
gates n_ and n+. We show explicitly in m how one can generate these gates 
using the Weinberg model. 

To conclude: we have seen that nonlinear quantum mechanics enables one 
to solve NP complete problems in polynomial time. In fact, it is not difficult to 
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generalize this algorithm to solve the oracle problem^, which in turn implies 
that a nonlinear quantum computer can solve in polynomial time all problems 
in the entire polynomial hierarchy. Nevertheless, the authors would like to em- 
phasize that these results are probably best viewed as further evidence that the 
universe is exactly linear - rather than as blueprints for the design of a machine 
if it were not. (Although it is certainly not obvious, a priori, that quantum me- 
chanics need be strictly linear - and the question can be fairly viewed as an 
experimental one). Regardless, the connection between physics and computa- 
tion is now made unavoidable: the underlying laws of physics strongly impact 
the theoretical complexity of computational problems. 
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® To solve the problems in the class ifP, one needs to determine the exact nnmber 
of solutions i for which f(i) = l, instead of just whether or not such solutions exist. 
This can be accomplished if one replaces the flag qnbit with a string of log 2 n qubits 
and modifies the algorithm slightly - so that it adds the number of solutions in each 
iteration rather than performing what is effectively a one bit AND. In this case, a 
measnrement of the final result reveals the exact number of solutions. 
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Abstract. A quantum computer can efficiently find the order of an el- 
ement in a group, factors of composite integers, discrete logarithms, sta- 
bilisers in Abelian groups, and hidden or unknown subgroups of Abelian 
groups. It is already known how to phrase the first four problems as the 
estimation of eigenvalues of certain unitary operators. Here we show how 
the solution to the more general Abelian hidden subgroup problem can 
also be described and analysed as such. We then point out how certain 
instances of these problems can be solved with only one control qubit, 
or flying gubits, instead of entire registers of control qubits. 



1 Introduction 

Shor’s approach to factoring |SE|, (by finding the order of elements in the multi- 
plicative group of integers mod N, referred to as Z^) is to extract the period in a 
superposition by applying a Fourier transform. Another approach, based on Ki- 
taev’s technique |E|, is to estimate an eigenvalue of a certain unitary operator. 
The difference between the two analyses is that the first one considers (or even 
’measures’ or ’observes’) the target or output register in the standard computa- 
tional basis, while the analysis we detail here considers the target register in a 
basis containing eigenvectors of unitary operators related to the function /. The 
actual network of quantum gates, as highlighted in , is the same for both 

algorithms; it is helpful to understand both approaches. In some cases, which we 
discuss in Sect. 0 this approach suggests implementations which do not require 
a register of control qubits. A more general formulation of the order- finding 
problem as well as the discrete logarithm problem, and the Abelian stabiliser 
problem, is the hidden subgroup problem (or the unknown subgroup problem 
IIH0I ) . In the case that G is presented as the product of a finite number of cyclic 
groups (so G is finitely generated and Abelian), all of these problems are solved 
by the familiar sequence of a Fourier transform, a function application, and an 
inverse Fourier transform. In this paper we describe how this more general prob- 
lem can also be viewed and analysed as an estimation of eigenvalues of unitary 
operators. 



C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 174-E2S1 1999. 
© Springer- Verlag Berlin Heidelberg 1999 



The Hidden Subgroup Problem and Eigenvalue Estimation 



175 



2 The Hidden Subgroup Problem 

Let / be a function from a finitely generated group G to a finite set X such that 
/ is constant on the cosets of a subgroup K (of finite index, since X is finite), 
and distinct on each coset. The hidden subgroup problem is to find K (that is, a 
generating set for K), given a way of computing /. When K is normal in G, we 
could in fact decompose f as ho where 5 is a homomorphism from G to some 
finite group iJ, and h is some 1-to-l mapping from H to the set X. In this case, K 
corresponds to the kernel of g and H is isomorphic to GjK. We will occasionally 
refer to this decomposition, which we illustrate in Fig. Q1 Define the input size, 



f =h-g 

G s 




Fig. 1. The function / can be viewed as the composition of a homomorphism 
g to a group H , and some 1-to-l mapping h to the set X. Our hidden subgroup 
K will be the kernel of g, and H is isomorphic to GjK. 



n, to be of order log 2 [G : K], We will count the number of operations, or the 
running time, in terms of n. An algorithm is considered ejficient if its running 
time is polynomial in the input size. By elementary quantum operations, we are 
referring to a finite set of quantum logic gates which allow us to approximate any 
unitary operation. See [BBCDMSSSWj for a discussion and further references. 
Our running times will always refer to expected running times, unless explicitly 
stated otherwise. By expected running time we are referring to the expected 
number of operations for any input (and not just an average of the expected 
running times over all inputs). 

We should be clear about what it means to have a finitely generated group 
G, and to be able to compute the function /. This is difficult without losing 
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some generality or being dry and technical, or both. The algorithms we describe 
only apply for groups G which are represented as finite tuples of integers cor- 
responding to the direct product of cyclic groups (consequently, G is finitely 
generated and Abelian). Conversely, for any finitely generated Abelian G, there 
is a temptation to point out that G is isomorphic to such a direct product of 
cyclic groups, and assume that we can easily access this product structure. This 
is not always the case, even in cases of practical interest. For example, the 
multiplicative group of integers modulo N for some large integer N , which is 
Abelian of order (the Euler (^function) and thus isomorphic to a product 

of cyclic groups of prime power order. We will not necessarily know or 

have a factorisation of it along with a set of generators for However, in light 
of the quantum algorithms described in this paper, we could efficiently find such 
an isomorphism, thereby increasing the number of finitely generated Abelian 
groups which can be efficiently expressed in a manner which allows us to employ 
these algorithms. We will however leave further discussion of these details to 
another note |EMj . When we talk about computing /, we assume that we have 
some unitary operation Uj which takes us from state | x) | 0) to | x) | /(x)). It 
could, for example, take | x) | y) to | x) | y -f /(x)), where -I- denotes an appro- 
priate group operation, such as addition modulo N when the second register is 
used to represent the integers modulo N . 

Various cases of the hidden subgroup problem are described in IHII, IHi, EH> 
{HLj, inil, {El, (DEMM), and IHol . We note that m also covers the case that 
/ is not necessarily distinct on each coset (that is, h is not 1-to-l), and this is 
discussed in the appendix. Finding the order r of an element in a group H of 
unknown size, or the period r of a function /, is a special case where G = Z and 
K = rZi. For any generator ej of a finitely generated G, we can use the algorithm 
in Sect. l4.'/!l to find an integer k such that /(fcej) = /(O), so that fcej G K. We 
find this k with 0(n) applications of / and O(n^) other elementary quantum 
operations. We can then assume that ej is of order k (that is, factor (fcej) out 
of G), and in general assume that G is a finite group. 

We give a few examples. 

Deutsch’s Problem: Consider a function / mapping Z 2 = {0, 1} to {0, 1}. 
Then f{x) — f{y) if and only x — y G K, where where K is either {0} or 
Z 2 = {0, 1}. If K is {0}, then / is I — to — I (or balanced), and if K is Z 2 then 
/ is constant. iraiCKMVf] 

Simon’s Problem: Consider a function / from Z 2 * to some set X with the 
property that f{x) = f(y) if and only ii x — y G {0,s} for some string s of 
length 1. Here K = {0,s} is the hidden subgroup of Z 2 ^ Simon presents 
an efficient algorithm for solving this problem, and the solution to the hidden 
subgroup problem in the Abelian case is a generalisation. 

Discrete Logarithms: Let G be the group Z,. x Z^ where Z,. is the additive 
group of integers modulo r. Let the set X be the subgroup generated by some 
element a of a group H , with oT = 1. For example, H = F*, the multiplicative 
group of the field of order q, where r = q — 1. Let a,b G G, and suppose 
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b = o’”. Define / to map (x,y) to Here the hidden subgroup of G is itT = 
{{k, —km)\k = 0 , 1 , . . . , r — 1 } = (( 1 , —m)), the subgroup generated by ( 1 , —m). 
Finding this hidden subgroup will give us the logarithm of b to the base a. The 
security of the U.S. Digital Signature Algorithm is based on the computational 
difficulty of this problem in F* (see Mm\ for details and references). Here 
the input size is n = [log2 r~\ . Shor’s algorithm IShI was the first to solve this 
problem efficiently. In this case, / is also a homomorphism which can make 
implementations more simple as described in Sect .|3 

Self-Shift-Equivalent Polynomials: Given a polynomial P in I variables 
Xi, X2, ...,Xi over F^, the function / which maps (oi, 02, . . . , a;) G F^ to 
P(Ai — oi, A2 — 02 , . . . ,X[ — ai) is constant on cosets of a subgroup AT of F^. This 
subgroup K is the set of self-shift-equivalences of the polynomial P. Grigoriev 
m shows how to compute this subgroup. He also shows, in the case that q has 
characteristic 2 , how to decide if two polynomials P± and P2 are shift-equivalent, 
and to generate the set of elements (oi, 02, ■ ■ ■ ,ai) such that Pi(A'i — 01,^2 — 
G2, ■ ■ ■ , Xi — ai) = P2{Xi, X2, ■ . ■ ,Xi). The input size n is at most I log2 q. 

Abelian Stabiliser Problem: Let G be any group acting on a finite set 
X. That is, each element of G acts as a map from X to X, in such a way that 
for any two elements a, 6 G G, a(b(x)) = (ab)(x) for all x G X. For a particular 
element x of X, the set of elements which fix x (that is, the elements a G G such 
that a(x) = x), form a subgroup. This subgroup is called the stabiliser of x in 
G, denoted Sto{x). Let denote the function from G to A which maps g G G 
to g{x). The hidden subgroup corresponding to fx is K = Stc{x). The finitely 
generated Abelian case of this problem was solved by Kitaev m, and includes 
finding orders and discrete logarithms as special cases. 

3 Phase Estimation and the Quantum Fourier Transform 

In this section, we review the relationship between phase estimation and the 
quantum Fourier transform which was highlighted in j( )H;ivi IVij . 

The quantum Fourier transform for the cyclic group of order N, F^, maps 




N-l 



So Fj^j^ maps 




AT-l 



More generally, for any (j), 0 < (j) < 1, maps 




N-l 



N-l 



( 1 ) 
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where the amplitudes are concentrated near values of x such that x/N are 
good estimates of 4>. The closest estimate of 4> will have amplitude at least 4 /tt^. 
The probability that x/N will be within k/N of 4> is at least 1 — l/(2fc — 1). See 
| |( T; M M I for details in the case that N is a, power of 2; the same proof works for 
any N. Thus to estimate (p such that, with probability at least 1 — e, the error 
is less than 1/M, we should use a control register containing values from 0 to 
— 1 and apply for any N > M(l/e+l)/2. For example, if we desire an 
error of at most 1/2" with probability at least 1 — 1/2™ we could use N = 2"+™. 
In practice, it will be best to use the N that corresponds to the group that is 
easiest to represent and work with in the particular physical realisation of the 
quantum computer at hand. We expect that this N will be a power of two. 

For convenience, we will omit normalising factors in the remainder of this 
paper. It will also be convenient to have a compact notation for the state on the 
right hand side of du which we consider to be a good estimator for \(j)). So let 
us refer to this state as | p) ^ or just | cp) if the value of N is understood. Lastly, 
we will use exp(a;) to denote e“. 

4 The Algorithm 

To restrict attention from finitely generated groups G to finite groups we need 
to know how to solve the cyclic case (just one generator), that is, to find the 
period of a function from Z to the set X. We will first describe how to find the 
order of an element a in a group H, or equivalently, the period of the function 
/ : t ^ a*, as Shor |Sh] did for the group H = Z^, the multiplicative group of 
integers modulo N . We will then show how to generalise it to find the period of 
any function f : Z X. li f were a homomorphism (so h is an isomorphism of 
H, when / is decomposed as f = ho g), we would just be finding the order of 
/(I) in H . The difference is that we are showing how to deal with a non-trivial 
h which hides the homomorphism structure. The details will also help explain 
how to find hidden subgroups of finite Abelian groups. 

4.1 Finding Orders in Groups 

We have an element a from a group F[ and we wish to find the smallest positive 
integer r such that o’" = 1. The group H is not necessarily Abelian; all that 
matters is that the subgroup generated by a is Abelian, and this is always true. 
The idea is to create an operator Ua which corresponds to multiplication by a 
(so it maps | ?/) to | ay)). Since o’" = 1, then U// = I, the identity operator. Hence 
the eigenvalues of Ua are rth roots of unity, exp{2nik/r), fc = 0, 1, . . . , r — 1. By 
estimating a random eigenvalue of Ua, with accuracy l/2r^, we can determine 
the fraction k/r. The denominator (with the fraction in lowest terms) will be a 
factor of r. We thus seek to estimate an eigenvalue of Ua', note that Ua = Ua^. 

For any integer x define t/a* to be the operator that maps | y) to \a^y). 
Define U^x to be the operator which maps | x) | y) to | x) Ua^ \y) = | a:) | a^^y). 
Note that U^x acts on two registers and x is a variable which takes on the value 
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in the first register, while C/a* acts on one register and x is fixed. Consider the 
eigenvectors 



1 

I ^k) = ^exp(-27riA:i/r) | a*) , A: = 0, 1, . . . , r - 1, (2) 

of Ua^ and respective eigenvalues exp(27rifca;/r) . If we start with the superposi- 
tion 

2'-l 

I I 

x—0 

and then apply C/^x we get 

2'-l 

exp(2Trikx /r) \ x) \ Wk) ■ 

X—0 

As discussed in the previous section, applying F~,^ to the first register gives 

I k/r) I Fk) and thus a good estimate of k/r. 

We will not typically have \Fk) but we do know that 1 1) = 

Therefore we can start with 



\0)\1) = \0)Y^\F,) = J2\0)\F,) (3) 

k—0 k—0 

and then apply F 21 to the first register to produce 

r-l /2'-l \ 

E 1'^'^)- (4) 

k—0 ytc— 0 J 

We then apply C/^x to get 

r-l /2'-l \ 

EE exp{2TTikx/r) \ x) j \ Fk) (5) 

yai— 0 J 

followed by F~i^ on the control register to yield 

r— 1 

E I I 

fc=0 

Observing the first register will give an estimate of k/r for an integer k chosen 
uniformly at random from the set {0,1,. ..,r — 1}. As shown in jHE], we choose 

1 > 2 log 2 r, and use the continued fractions algorithm to find the fraction k/r. Of 
course, we do not know r, so we must either use an I we know will be larger than 

2 log 2 r, such as 2 log 2 N in the case that FI is . (Alternatively, we could guess 
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a lower bound for r, and if the algorithm fails, subsequently double the guess 
and repeat.) We then repeat 0(1) times to find r. This algorithm thus uses 0(1) 
exponentiations, or 0(n) group multiplications, and 0{'n?) elementary quantum 
operations to do the Fourier transforms. 

We can factor the integer N by finding orders of elements in This uses 
only 0{n^) or exp(clogn) elementary quantum operations, for c = 3 + o(l) 
(or c = 2 + o(l) if we use fast Fourier transform techniques). Other deter- 
ministic factoring methods will factor N in or exp(cn) steps, where 

c = 1/2-1- o(l). The best known rigorous probabilistic classical algorithm (using 
index calculus methods) m uses exp(c(nlogn)^/^) elementary classical opera- 
tions, c = 1 -|- o(l). There is also an algorithm with a heuristic expected running 
time of exp(c(n^/^(log n)^/^) elementary classical operations (see |M( )V) for an 
overview and references) for c = 1.902 -|- o(l). Thus, in terms of elementary 
operations, a quantum computer provides a drastic improvement over known 
classical methods to factor integers. 



4.2 Finding the Period of a Function 

The above algorithm, as pointed out in m, can be applied to a more general 
setting. Replace the mapping from t to a* with any function / from the integers 
to some finite set X. Define tf/(x) to be an operator that maps f{y) to f{y + x). 
This is a generalisation of Ua^ except it does not matter how it is defined on 
values not in the range of /, as long as it is unitary. Define U f(x) to be an 
operator which maps | x) \ f{y)) to | x)Uf(^) \ f{y)) = \x)\f{y + x)). 

The following are eigenvectors of U 

r—1 

I ^ exp(-27rifct/r) | f{t )) , fc = 0, 1, . . . , r - 1, (7) 

with respective eigenvalues exp(27rz/px/r). As in (|3I), we can start with 

r—1 

|0)|/(0)) = 5]|0)|tf^,) 

k=0 

except with our new, more general, definition of | ^k)- We apply F 2 n to the first 
register to produce and then apply C7/(x) to produce (0, followed by 
to get ( 0 . Observing the first register will give an estimate of k/r for an integer 
k chosen uniformly at random, and the same analysis as in the previous section 
applies to find r. 

One important issue is how to compute U f(^x) only knowing how to compute 
/. Note that from (0 to 0 (using the modified definition of \^k)) we simply 
go from 



2"-l 2"^-! /r-l 

^ |x)|/(0))= ^ ^|x)|^,) 

tc— 0 x—0 \k—0 



(8) 
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to 

2"-l 2"-l /r-1 \ 

^ I x) I f{x)) = I XI exp(27rix/c/r) | <I'k) j (9) 

a :— 0 X — 0 \ fc — 0 / 

which could be accomplished by applying Uf, which we do have, to the starting 
state 

2”-l 

E i^)io)- 

x—O 

Thus even if we do not know how to explicitly compute the operators 
any operator Uf which computes the function / will give us the state Q. This 
state permits us to estimate an eigenvalue of tf/(x) which lets us find the period 
of the function / with just 0(1) applications of the operator Uf and O(n^) 
other elementary operations. The equality in (01 is the key to the equivalence 
between the two approaches to these quantum algorithms. On the left hand side 
is the original approach (IS], IBS, which considers the target register in 

the standard computational basis. We can analyse the Fourier transform of the 
preimages of these basis states, which is less easy when the Fourier transforms 
do not exactly correspond to the group G. On the right hand side of we 
consider the target register in a basis containing the eigenvectors of the unitary 
operators which we apply to it (as done in |Kij and jOFM M| . for example), and 
this gives us (0) , from which it is easy to see and analyse the effect of the inverse 
Fourier transform even when it does not perfectly match the size of G. 

4.3 Finding Hidden Subgroups 

As discussed in Sect. 0 any finite Abelian group G is the product of cyclic groups. 
In light of the order- finding algorithm, which also permits us to factor, we can 
assume that the group G is represented as a product of cyclic groups of prime 
power order. Further, for any product of two groups Gp and Gq whose orders 
are coprime, any subgroup K of Gp x Gq must be equal to Kp x Kq from some 
subgroups Kp and Kq of Gp and Gq respectively. We can therefore consider our 
function / separately on Gp and Gq and determine Kp and Kq separately. Thus 
we can further restrict ourselves to groups G of prime power order. This not 
only simplifies any analysis, it could reduce the size of quantum control registers 
necessary in any implementation of these algorithms. 

Let us thus assume that G — Z^mi x x • • • x Z^m, for some prime p and 
positive integers toi < m 2 < ■ ■ ■ < mi = m. Let K = {k= (/ci,/c 2 ,...,fc;)|/(x) = 
/(x + k) for all x G G}. The ’promise’ is that / is constant on cosets of K, 
and distinct on each coset. In practice, this will usually be a consequence of the 
nature of /, as in the case of discrete logarithms where f{xi,X 2 ) = or 

whenever / is constructed as h o g for some homomorphism g from G to some 
finite group iJ, and a 1-to-l mapping h from H to the set X. 

Let [// be an operator which maps | x) | 0) to | x) | /(x)). Define ei = (1, 0, . . . , 
0), G 2 = (0,1,0,..., 0), and so on. Let us also consider an operator related to Uf, 
t^/(xej), which maps | x) \ /(y)) to | x)Uf(^^^.) \ /(y)) = \x)\f{y + xej)). In the 
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case of Simon’s Problem, t//(x(o,i,o)) maps | 1) | /(yi, ?/2, J/s)) to | 1) C//(o,i.o) l/(2/i> 
2/2, 2/3))) = I 1) I/(2 /i, 2/2 + 1,2/3)) and does nothing to | 0) | /(yi, j/2, 2/3))- 
For each t = ■ ■ ■ ,t[), 0 < tj < satisfying 

i 

= 0 modp™ for all he a: (10) 

i=i 

define 

I ^t) = E exp I ^ I I /(a)) . (11) 

a&G/K \ ^ 3 = 1 ) 

We are summing over a set of representatives of the cosets of K modulo G, 
and by condition dinj on t, this sum is well-defined. Let T denote the set of t 
satisfying m, which corresponds to the group of characters of GjK. The | Ify) 
are eigenvectors of each t//(a,ej), with respective eigenvalues exp(27ria;fy/p”’'^ ) . 
By determining these eigenvalues, for j = 1, 2, . . . , Z, we will determine t. If we 
had I 'Ft) in an auxiliary register, we could estimate tj/p'^^ using Uf(^xej) by the 
technique of the previous section. If we use F~m^ we would determine tj exactly, 
or we could use the simpler for some k > log2(p"*-’), and obtain tj with 

high probability. For simplicity, we will use F~mj . In practice we could use F~^^ 
for a large enough k so that the probability of error is sufficiently small. 

By estimating tj /p™'^ for j = 1,2,..., I, we determine t. The algorithm starts 
by preparing I control registers in the state | 0) and one target or auxiliary register 
in the state | ’ft), applies the appropriate Fourier transforms to produce 

E 1 ^ 1 ) ••• E 1^^) pt) (12) 

V 3 , 1=0 / V Xi =0 ) 

followed by Uf(^xej) for j = 1, 2, . . . , n, using the jth register as the control and 
I 'Ft) as the target, to produce 

f E I a^i)) ■ • • f E I a^i)) I '^t) • (13) 

\ 3 , 1=0 ^ /V a,i =0 ^ / 

Then apply F~J^ to the jth control register for each j to yield 

\h)\h)...\U)\'Ft) (14) 



from which we can extract t. As in the previous section, we do not know how to 
construct | <ft), but we do know that 

i/(o)) = Ei'^t). 

t£T 



|0)|0)...|0) |/(0)) = El0)|0)--'|0)l'^t) 

tGT 



So we start with 
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apply Fourier transforms to get 

XI X X |2^')]l^t) (15) 

t^T \ tci— 0 / \ a:/— 0 / 

then apply U /(xej) using the jth register as a control register, for j = 1, 2, . . . , n, 
and the last register as the target register to produce 

X f X I • • • f X I I '^t) . (16) 

tST V xi=0 ^ / V rci=0 ^ / 

We finally apply to the jth control register for j = 1, 2, . . . , Z, to produce 

Xii)i'^*)- (1^) 

tGT 

Observing the first register lets us sample the t’s uniformly at random, and thus 
with 0{n) repetitions we will, by (IIOII . have enough independent linear relations 
for us to determine a generating set for K . For example, in the case of Simon’s 
problem, the 1 1) all satisfy t-s = mod 2 = 0 mod 2, where K = {0, s}. 

We could also guarantee that each new non-zero element of T will increase the 
span by a technique discussed in the appendix. 

This analysis of eigenvectors and eigenvalues is based on the work in m- 
The problem is that, unlike in IKIl, we do not always have the operator f7/(xej)- 
However, note that, like in Sect. 14.21 going from (I I bll to (I I till maps 



X 1^) 1/(0)) 

\0<Xj<p'^3 J 

to 

X i^)i/(^)) 

0<Xj<p^3 

= xf X e^P(2’^*^)l^i)) f X exp(2^i^)|a:i)Wt). 

tGT V xi=0 ^ / \ xi=0 ^ / 

We can create state l|TO by applying Uf, which we do have, to the starting 
state 

X l^)|0) 

O^Xi Kp'^i 

and proceeding with the remainder of the algorithm. As in Sect. 14.21 we are con- 
sidering the target register in the basis containing the eigenvectors | Fk) instead 
of the computational basis. 
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5 Reducing the Size of Control Registers 

5.1 Discrete Logarithms 

In practice, it might be advantageous to reduce the number of qubits required 
to solve a problem, or the length of time each qubit must be isolated from 
the environment. For example, suppose we wish to find m such that o’” = b, 
where the order of a divides r. The operators Ua^ and Ub<^, which correspond to 
multiplication by and respectively, share the eigenvectors \^k) (see ()2J) 
and have corresponding eigenvalues exp(2Trikx / r) and exp(2Trikmx / r) . We can 
assume we know r by applying the order-finding algorithm if necessary. By using 
Ua X with one control register we can approximate k j and by using U with 
another control register we can approximate (km mod r)/r and then extract 
m modulo r/gcd(r, fc). Note that since we know r, we only need logr bits of 
precision when estimating k/r and (km mod r)/r, instead of 21 og 2 r when using 
continued fractions. Note further that, knowing r, it may be possible to actually 
place I into the target register (by direct construction or otherwise) for some 
known k, and thus only require one control register with over log 2 r qubits to 
estimate (km mod r)/r. One way of doing this is to keep the target register 
after we have applied the order-finding algorithm and observed an estimate of 
k/r in the control register. At this point, the target register is almost entirely 
in the state | and we could now just estimate the eigenvalue of C/f,® on this 
eigenstate, which we know will be (km mod r)/r. 

5.2 One Control Bit 

Consider the case that we have an efficient computational means of mapping 
I /(y)) to I /(y -I- x)) for any x. If we consider / to be of the form h o g for a 
homomorphism g, we are requiring that h is the identity or some other function 
with enough structure that we can efficiently map h(g(y)) to h(g(y -I- x)) = 
h(g(y) + g(x)). In this case we can efficiently solve the hidden subgroup problem 
with only one control bit or a sequence of flying qubits ITHI.MkI . We illustrate 
this method for the problem of finding the order of an element a in a group H . 

Figure |2| shows the relationship between Fff and the controlled multipli- 
cations by powers of a in the order-finding algorithm. As already pointed out 
in mH], the measurements could be performed before the controlled rotations. 
The quantum controlled rotations could then be replaced with semi-classically 
controlled rotations of the subsequent qubits. This brings us to Fig. El where 
we observe further that all the operations on the first qubit could be performed 
before we even prepare the second qubit. All the operations could be done se- 
quentially, starting from the first qubit, the results of measuring the previous 
qubits determining how to prepare the next qubit before measurement. This 
means we could in fact do all the quantum controlled multiplications with a 
single control qubit provided we can execute the semi-classical controls which 
allow us to reset a qubit to | 0) -I- 1 1) and perform a rotation dependent upon 
the previous measurements (the rotations could in fact be implemented at any 
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|o)+|i) 

| 0 )^| 1 ) 




Fig. 2. We start with (| 0) + | 1 ))(| 0) + 1 1 ))(| 0) + 1 1 )) 1 1'k) = ELo I I ^fe)- 
The controlled multiplications create the state Ea:=o I I 

remaining gates create the state | fc/r) (apart from reversing the order of the 
qubits) which we then observe. The i?-gates correspond to Hadamard trans- 
forms, and the i?j-gates correspond to a controlled phase shift of exp( 27 ri/ 2 -’) 
on state | 1). 




Fig. 3. Here we employ a semi-classical version of ^23^. We could measure 
each qubit before it is used as a control, perform the controlled rotations 
semi-classically, and the probability of observing each possible output state 
I xi) I X2) I X3) is the same as in Fig. El 
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time after resetting the qubit and before applying the final Hadamard transform 
and measuring it; they could also be omitted provided we repeat each step a few 
extra times and do some additional classical post-processing as done in m)- 
Alternatively, the control qubits could be a sequence of flying qubits which are 
measured (or prepared) in a way dependent upon the outcomes of the previous 
measurements of control qubits. 

For the more general hidden subgroup problem in Abelian groups we would 
have a sequence of applications of C//(xej) controlled by one qubit, which is 
measured, then reset to a superposition of | 0) and 1 1) plus some rotation that 
is dependent upon the previous measurements. In summary. 

Remark 1. The hidden subgroup iF of a finitely generated Abelian group G 
generated by ei, 62 , . . . Ok, corresponding to a function / from G to a finite set 
X, can be found with probability close to 1 by semi-classical methods with only 
one control bit (or a sequence of flying qubits) and polynomial in n applications 
of the operators | x) \ /(y)) ^ | x) | /(y -I- xej)) for j = 1, 2, . . . , fc, where n is the 
index oi K in G. 
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Appendix: When / Is Many-to-1 on GJK 

The question of what happens when / is many-to-1 on cosets of K was first 
addressed in IBS' This is a slight weakening of the promise that / is distinct 
on each coset. Suppose / can have up to m cosets going to the same output, 
for some known m. That is, f = h o g where g is a homomorphism from G to a 
some group H with kernel K, and h is a mapping from H to X that is at most 
TO-to-1. If m divides the order of K, we clearly have a problem. For example, 
suppose K is the cyclic group of order 2M , and m = 2, but by changing one 
value of / it would have period M. It can easily be shown that (that 

is, at least c'/M for some positive constant c) applications of / are necessary 
to distinguish such a modified / from the original one with probability greater 
than 3/4, and thus no polynomial time algorithm, quantum or classical, could 
distinguish the two cases. Thus one requirement for there to exist an efficient 
solution in the worst case is that m is less than the smallest prime factor of \K\, 
the number of elements in K. 

The problem when / is not 1-to-l is the following. Running the same quantum 
algorithm will produce the state 

k^O 

where 

r— 1 

I = E exp(-27rfM/r) | f{t)) . 

This is the same definition as in dzj except now the | f{t)) are not necessarily 
distinct. This means the sizes of each of the | iF/) are not necessarily the same 
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since both destructive and constructive interference can occur. Also, the | 
are no longer orthogonal, and thus some constructive interference could occur 
on the poor estimates oik/r. Recall that even the close estimates oik/r will not 
yield useful results when k = 0. Any other k will at least reveal a small factor 
of r. So we need to guarantee that the probability of observing a close enough 
estimate of k/r for some k ^ 0 is significant. 

By making our estimates precise enough, say by using over 21og2r + e/m? 
control qubits, the estimates of k/r will have error less than l/2r^ (so that 
continued fractions will work) with probability at least 1 — e/m^. Thus assuming 
/ is 1-to-l, the probability of observing a bad output other than 0 would be at 
most e/m?, and the probability of observing 0 would be at most 1/r + e/w?. 
However, since / is at most m-to-1, these probabilities could amplify by at most 
a factor of m? to e and m? /r + e respectively. Observing a 0 means we either 
got a bad output, or the period of / is 1. Getting 0 as a bad output is not very 
harmful, however getting another bad output is more complicated, since it will 
give us a false factor of r. It will be useful to make e small, so that it is unlikely 
our answer is tainted by false factors of r. Once we have one factor r\ of r, 
we can replace f{x) with f{rix) (as done in [BLj 1 . which has period r/ri and 
find a factor of r/ri. Once we have a big enough factor r' of r, we might start 
observing O’s, which tells us that the remaining factor of the original r, namely 
r/r', is less than m^. Thus we can explicitly test f{r'),f{2r'),f{3r'), . . . , until 
we find the period, which will occur after at most applications. We thus have 
an algorithm with running time, in terms of elementary quantum operations and 
applications of /, polynomial in log(T’) and quadratic in m. 

The trick of reducing the order of the function can be applied to reduce the 
size of the group and hidden subgroup in the finite Abelian hidden subgroup 
problem. When G — Zp, we can efficiently test if K = G or K = {!}. The above 
analysis tells us how to deal with the case that G = Z^i for n > 1. A similar 
technique will reduce G — Zpi^ x • • • Z^i,. to a quotient group G and we can again 
proceed inductively until the size of G is less than . We can then exhaustively 
test G for the hidden subgroup K in another 0{m?) steps. 

We emphasize that this is a worst-case analysis. If there were a noticeable 
difference in the behaviour of a 1-to-l and an m-to-1 function /, m > 1, we 
could decide if a given function h is 1-to-l or many-to-one (by composing h 
with a function / whose period or hidden Abelian subgroup we know, and test 
for this difference in behaviour). Distinguishing 1-to-l functions from many-to-1 
functions seems like a very difficult task in general, and would solve the graph 
automorphism problem, for example. 
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Abstract. In the diakoptic approach, mechanisms are divided into sim- 
pler parts interconnected in some standard way (say by a “mechanical 
Connection”). We explore the possibility of applying this approach to 
quantum mechanisms; the specialties of the quantum domain seem to 
yield a richer result. First parts are made independent of each other by 
assuming that Connections are removed. The overall state would thus 
become a superposition of tensor products of the eigenstates of the inde- 
pendent parts. Connections are restored by projecting off all the tensor 
products which violate them. This would be performed by particle statis- 
tics, under a special interpretation thereof. The NP-complete problem of 
testing the satisfiability of a Boolean network is approached in this way. 

The diakoptic approach appears to be potentially able of taming the 
quantum whole without clipping its richness. 

BAGS: 89.70.-bc, 89.80.-kh. 

1 Definition of Qnantnm Mechanical Connection 

In (classical) applied mechanics, the diakoptic (dissectionistic) approach is ex- 
emplified by the notion of mechanical Connection. Connections divide the whole 
into simpler parts and reconstruct it — they introduce a “divide and conquer” 
strategy. In fig. 1(a), a crank-shaft is the Connection which imposes an invertible 
function between the positions of parts r and s (here discretized as 0 and 1, then 
the function is the Boolean NOT)0 

Things can be more difficult in quantum mechanics, since the Hamiltonian 
of a Connection may not commute with the parts Hamiltonians. This difficulty 
is avoided by implementing each Connection through a form of constructive and 
destructive interference, assumedly related to particle statistics. By applying 
reverse engineering, the Connection is first introduced as a mathematical fea- 
ture that would be nice-to-have in quantum mechanisms. Then we ask ourselves 
whether that feature can be physical. 

Let us consider the mechanism of fig. 1 from a quantum perspective. The 
Connection should establish a constraint between two otherwise independent 
quantum parts r and s, with eigenstates respectively |0)^, |1)^ and |0)j,, |1)^ (fig. 
lb). Their overall state should have the form 

^ The term “diakoptic” is borrowed from Gabriel Kron, who has developed a powerful 
formalization of this engineering methodologybl . 
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W) = a |0)^ |1)^ + /3 |l)r |0)^ > with \a\^ + |/3|^ = 1; 

the eigenvalues of each tensor product satisfy the Boolean NOT (the constraint) 
and \tf) is free to “move” in the two-dimensional Hilbert space Hg = 
span{\Q)^ |1)^ , |1)^ |0)g} ~ which gives the one degree of freedom required from 
a Connection. 




overall eyinmetij’ = connection 



(a) 

FIGURE 1. 



(b) 



Let us assume the Connection to be temporarily removed. The generic states 
of the two independent parts are: |0)^ -I- (3r |l)j. , |!?')s = oig |0)g -I- 

(3s |l)s. The whole unentangled state in the qubits Hilbert space is |'f') = 
«o |0)^ |0)^ -I- ai |0)^ |1)^ -I- «2 |l)r |0)s + «3 |l)r |l)s : with = ttrCTs, etc. The 
Connection is restored by projecting |!f') on the “symmetric” subspace Hg. Let 
us define the projector (or “symmetry”) Arg by: 



A. |0), |1), = |0), |1), , |1), |0), = |1), |0), , 

The Ars projection of |'f') is the normalized vector of Hg closest to it. This is 
obtained (in a peculiar way whose motivation will be clarified) by submitting a 
free normalized vector \(p) of (whose amplitudes on the basis vectors of 
are free and independent variables up to normalization) to the mathematically 
simultaneous conditions: (i) Arg \ip) = |v?) , and (ii) the distance between the 
vector before projection fP) and that after projection \(p) should be minimum; 
in equivalent terms ||(lf" |t/?)|| should be maximum. This yields the usual result 
\<p) = k («i |0)^ |l)g -I- 02 |1)^ |0)g) , an allowed Connection state {k is the renor- 
malization factor). The Connection will perform by operating on the parts under 
continuous Ars projection of the whole on Hg. 
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2 A Diakoptic Interpretation of Particle Statistics 

To give an introductory example, let us show a sort of Connection simply related 
to particle statistics. Let 1 and 2 be two free, identical and non-interacting spin 
1/2 particles. At a given time, their overall spatial wave function is obtained by 
symmetrizing/ antisymmetrizing (under particles permutation P12) the product 
of the two independent wave functions: 

^{xiX2) ^ {1 ±Pi2)'Pa{xi)'Pb{x2) = 

xi and X2 are the particles spatial coordinates; the -I- (— ) sign goes with the spin sin- 
glet (triplet) state (normalization is disregarded) . As readily seen: \\<P (xi,X2)||^ = 
cos^ kx for the singlet state, W'P (xi_a;2)||^ = s\v? kx for the triplet state, where 
X = xi — X2i k = kA — ks- Thus close (separated) particles are more likely to 
be found in a singlet (triplet) state. There is a sort of Connection inducing a 
correlation between the mutual distance of the two particles and the character 
of their spin state. Noticeably, this kind of Connection would fall apart if the 
two particles were not identical. We should note that symmetrization (antisym- 
metrization) can be seen as the result of continuous projection (at any time t) of 
the state of the two independent particles on a symmetrical (antisymmetrical) 
subspace. 

Let us discuss the notion that particle statistics symmetries can be due to 
projection. As another example, we consider a pair of identical bosons labeled 
1 and 2 ; S'12 = ^ (1 -I- P12) is the usual symmetrization projector; 0/1 stand for, 
say, horizontal/ vertical polarization. The symmetry S'12 W) = W) is satisfied in 

H, = span ||0)i lO)^ , |l)i ll)^ , ^ (|0), ll)^ + |1), . 

There is a common didactic way of introducing this kind of symmetry. First, 
statistics is disregarded and the particles are assumed to be independent of each 
other (like in the above case of the two free particles). Let their unentangled 
state at time t be I'F {t)) = «o | 0 )i |0)2 + | 0 )i |1)2 + *^2 |l)i |0)2 + 0:3 |l)i |1)2 . 

Second, statistics is recovered by symmetrizing |!F (t)) , namely by projecting it 
on Ht- We generalize the example of the free particles by taking this didactic 
procedure seriously: particle statistics is interpreted as the result of projection 
on a predetermined Hilbert subspace — the one satisfying the symmetry — of a 
system state which could otherwise be out of symmetry (as will be clarified in 
the following Sections). 

We shall discuss this interpretation preliminarly. It amounts to considering 
the equation 

Vt: 5 i 2 |'F(t)) = |'F(t)), ( 2 . 1 ) 

as a constraint applied to |iF (t)). When a particle statistics symmetry is an initial 
condition conserved as a constant of motion, this constraint is redundant. How- 
ever, the notion of Connection will be related to particle statistics by means of a 
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counterfactual reasoning based on eq. (2.1). The idea is that |'f' (t)), symmetrical 
at time ti , could be pushed out of symmetry at an immediately subsequent time 
t 2 > ti] but in this case eq. (2.1) would project it back on Ht- Particle statistics 
would operate like a special watch-dog effect internal to the endosystem, or like 
destructive and constructive interference, by killing the amplitudes of those ba- 
sis vectors (in a suitable reference) of \'F (t 2 )) which violate the symmetry, and 
reinforcing the other amplitudes through re-normalization. This can be seen as 
a continuous form of partial state vector reduction on a symmetrical subspace. 

To see why |>f"(t)) could be “pushed out of symmetry”, we consider the 
system defined in Section 1 and Ars projection. In a first step, Ars projection 
is disregarded while parts r and s are assumed to be independent of each other. 
An operation on part r could well push the overall state (t)) out of symmetry, 
but in a second step this is prevented by the continuous projection of |'f' (t)) on 
Hs'.yt: Ars 1^ (i)) = 1^ (i)) • ^rs will be related to A 12 in Section 5. 

We should note that this projection (or, if one prefers, state vector reduction 
on a predetermined subspace) will in general alter the entanglement between the 
parts r and s, thus the coherence elements of pr (t) (part r density matrix). How- 
ever, it does not alter the diagonal elements of pr (t); the diagonal is determined 
by the operation performed on part r, namely it is a constraint to be satisfied by 
projection. Furthermore, the distance between the vector before projection and 
the vector after projection should be minimum. Interestingly, this is like partial 
state vector reduction — from a Hilbert space Hyj to a subspace Hg C Hyj. The 
outcome of reduction can be obtained by submitting a free vector of Hg to the 
condition that its distance from the vector before reduction is minimuml^^l. As a 
matter of fact, it is like particle statistics induced a continuous form of (partial) 
state vector reduction on a predetermined subspace. 



3 Behaviour of the Quantum Mechanical Connection 



Given the Connection r,s defined in Section 1, let us consider an operation 
performed on just one qubit, say r. Let this be the continuous rotation Qr {(fi) = 
cos(/?|0),, (0|^ -sin(/?|0),, (1|,, -I- sin (/? |1)^ (0|,, -I- cos (/? |1),, (1|^, with p = cot and t 
going from 0 to We shall examine the effect of applying Qr (p) to qubit r, 

Pr{t) = Qr{ujt) Pr{0)Ql(ujt) , (3.1) 

under continuous Arg projection of the overall state. 

Let the Connection initial state be the “symmetrical” state (whose tensor 
products satisfy symmetry Ars): 

\W (0)) = cost 9 |0)^ |1)^ -k sint9 |1)^ |0)^ . (3.2) 

Successive states are obtained by submitting a free normalized vector |<F (t)) of 
the Hilbert space Hru (Section 1) to the mathematically simultaneous conditions: 
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for all t {or (p): 

i) Ars\^{t)) = (i)); 

ii) Pr (t) = Trs{\1'{t)) i'I'{t)\) = cos^ (t? + (/?) |0)^ (Ol^ + sin^ + </?) |l)r (llr-5 

means partial trace over s. Under condition (i), |!f' (t)) has the form a (t) |0)^ |l)s+ 
(3{t) |1)^ |0)^, thus Pr (t) is always a diagonal matrix: its coherent elements are 
killed by Ars projection (or reduction); 

iii) the distance between the vectors before and after projection is minimum. 
Since projection is continuous, || ( 'f' (t) | (t + At)) || must be maximized orderly 
for t = 0, t = At, t = 2At, ..., t = tVAt, where At = then the limit 
for N —>■ oo must be taken (however, maximization ordering turns out to be 
irrelevant). 

Conditions (i) and (ii) yield |!f' (t)) = cos {ilf+p) |0),, |l)s+e*'^ sin (-d + p) |1)^ |0)^ , 
with S unconstrained, as can be checked; condition (iii), given the initial state 
(3.2), sets d = 0, yielding to the unitary evolution {p = cot): 

\<F (t)) = cos (it + p) 10),, |1)^ + sin {d + p) |1),, |0)^ . (3.3) 

This makes a “good” Connection. Qubit r rotation is transmitted to s: 



Trr (|if (Q) {'P (t)|) = Ps (t) = sin^ (d + p) |0)^ (Ol.+cos^ (d + p) |1)^ (1|^ . (3.4) 

Eigenvalues 0 and 1 are interchanged since one qubit is the NOT of the other. 
Noticeably, by simultaneously rotating the other extremity s of the Connection 
by the same amount, the same result (3.3) is obtained. This means adding eq. 
(3.4) as a condition, but this is redundant with respect to (i) and (ii), it was 
derived from (i) and (ii). Whereas, two different rotations of the two Connection 
extremities give an impossible mathematical system; this is a rigid Connection. 

It should be noted that a rotation p of qubit (part) r under Ars projection, 
is equivalent to applying the unitary operator Q (p) to the overall state I'P (t)): 

/ cos (/? sin 0 0 \ 

^ , — sin w cos p 0 0 

Q(p)= . , 

' ^ 0 0 cos p — sin p 
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0 sirup cosp I 


with |0)J1),, = 


0 

0 


,|l).|0),^ 


1 

1 




V) 




V) 



It can be seen that Q {p) operates on the overall state in an irreducible way, 
bringing it from I'F (0)) (3.2) to I'F (t)) (3.3) without ever violating Ars- We have 
thus ascertained a peculiar fact. Our operation on a part, blind to its effect on the 
whole, performed together with continuous Ars projection, generates a unitary 
transformation which is, so to speak, wise to the whole state, to how it should 
be transformed without violating Ars- Of course Ars ends up commuting with 
the resulting overall unitary propagator (shaped by it). 




194 G. Castagnoli and D. Monti 



4 Quantum Computation Networks 

Let us consider the reversible Boolean network of fig. 2(a), fully deployed in space 
— time is orthogonal to the network lay-out. This is different from sequential 
computation, where the Boolean network appears in the computation space-time 
diagram. 



t 



V 



O 



O 



u-l 






o 

r 



O 
s = 1 




(a) 

FIGURE 2. 



(b) 



Nodes t, u and v, r make the input and the output of a controlled NOT; 
r and s belong to a Connection. The c-NOT is made up of the four coexisting 
qubits t, u, V, r; it has four eigenstates which map the gate Boolean relation and 
constitute the basis of Hg = 

span{\0), |0)„ |0)„ |0), , |0), |1), |0)„ |1), , |1), |0)„ |1), |1), , |1), |1)„ |1), |0) J. 

Model Hamiltonians of such gates are given in [2]; they are different from time- 
sequential gates where the input and output are successive states of the same 

register. 5. 6] 

The satisfiability problem is stated by constraining part of the input and 
part of the output (just one output qubit is sufficient), and asking whether this 
network admits a solution. Let u = 1 and s = 1 be such constraints, u = 1 
(s = 1) propagates a conditional logical implication from left to right (right to 
left). Logical implication is conditioned by the values of the unconstrained part 
of the input (output). To have a solution, the two propagations must be matched, 
i.e. they must generate a univocal set of values on all the nodes of the network. 
Finding whether the network admits at least one match (one solution) is an NP- 
complete problem. Possible collisions (mismatch) between the two propagations 
will be both overcome and reconciled by the Connection. 

Let us assume that the network has just one solution (which is the case here: 
t = 1, M = 1, r = 0, u = 1, s = 1). The procedure to find it is as follows (this will 
hold for a generic network, thus we can think of many gates and Connections — 
see fig. 2b, where each wire should incorporate a NOT function and stands for 
a Connection). The output constraint is removed while an arbitrary value, here 
t = 0, is assigned to the unconstrained part of the input. The logical propagation 
of this input toward the output yields t = 0,u=l, r=l, s = 0{v = t will be 
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understood) . This computation is performed off line in polynomial time (in the 
network size). It serves to specify the initial state in which the network must be 
prepared: |'f' (0)) = |0)j 11)^^ |1)^ |0)^. This state satisfies the gate/s and the Con- 
nection/s, but qubit s is in |0)^ (0|g rather than |1)^ (Ij^ (the output constraint). 
It will be continuously rotated from |0)g (Oj^ to |l)g (Ij^ under Ars projection, 
while keeping = |1)^ (1|^ fixed. This transformation operates on the network 
Hilbert space here = Hg ® Hg where Hg = span {|0)^ , ll)^}. Note that 
all states of Hn natively satisfy the gate/s, not necessarily the Connection/s. 

At any time t, the state of the network is obtained by submitting a free 
normalized state |'f' (t)) of Hn to the conditions: 

for all t: 

i) Arg\^{t)) = (t)); 

ii) Trt,u,r il'l' (t)) (if (t)l) = Ps (t) = cos^ p|0)^ (0|^ + sin^ p |1)^ (1|^ , with p = wt 
and t going from 0 to 

iii) Trt^r.s (1?^ (t)) {'P (t)|) = Pu (0) = |1)„ (1|„; in a generic network there might 
be more conditions of this kind; 

iv) the distance between the vectors before and after projection is minimum as 
specified in Section 3. 

This yields: 

IW (t)) = cosp |0)j |1)„ |1)^ |0)^ + e*'^sinp |l)j |1)„ |0)^ |1)^ , (4.1) 

as is readily checked. For p = |, one obtains \'l' {-^)) = |l)t |l)u |0)r |l)s 5 
namely the solution. 

Transformation (4.1) brings the state of the network from satisfying only the 
input to satisfying both the input and the output constraints. It is obtained by 
“blindly” operating on divided parts of the network, but under Arg projection/s 
(the conquering factor) . 

If 5 does not change with time, it can be seen that propagation (4.1) is 
unitary. Let us further discuss this point; more generally, we consider a network 
admitting one or more solutions. The generic vector of Hn satisfying (i) through 
(iii) can be written sorting out solutions and non-solutions: 

( L-l M-l \ 

cosp^tti |i)g |0)^-ksinp a*l*)Q|l)J |1)„ 

i=0 i=L / 

L-l 2 2 

with l|c«j|l = X) l|o:i|| = 1. 1 1)„ ... denotes the tensor product of the con- 

i—0 i=L 

strained input qubits; s is the output qubit, whose density operator should be 
brought to |1)^ (1|^; |i)g denotes a tensor product of all other network qubits 
((i|g |j)g = Sij). The network tensor products corresponding to the first sum- 
mation do not satisfy the output constraint. Those of the second summation 
satisfy it. 

Let |0)g |0)g |1)„ ... be the network preparation. At p = 0+, “immediately 
after” starting qubit s rotation and for 0 ^ i ^ L — 1, condition (iv) sets 
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oii = 5o,i whereas, for L ^ i ^ M — 1, the are left unconstrained but for 
the above normalization condition. If one takes a random choice of them, this 
choice must stay frozen throughout ps (t) rotation, because of (iv) as is readily 
seen. Measurement at v? = f yields a solution |fc)g |l)g |1)„ ... {L ^ k ^ M — 1). 
Assuming that state vector reduction can be placed everywhere from preparation 
to measurement, if one prefers it could be located “immediately before” (/? = 0+. 
In this case qubit s rotation would rise from the beginning just one, randomly 
chosen, solution (namely, Oj = Sk.i for L ^ ai ^ M — 1). Anyhow, for ^ 0+ 
the network state can be written 

|!f (t)) = cos (/? |0) J l)p + sin V? 1 1) J0)p , 

where |0)p, |l)p are two constant (in time) orthonormal vectors of Hg. For a 
constant S (randomly chosen once for all), (t)) is a unitary evolution in the 
subspace Hg Hp, with Hp = span{\0)p , |l)p}. 

As a result of the foregoing process, Ars symmetries (or projectors) become 
constants of motion which commute with the network propagator at all times. 
They are also pairwise commuting, being applied to disjoint Hilbert spaces. 
However, the cause should not be confused with the effect. Ars projections shape 
or forge the unitary propagator with which they commute. To sum up, if the 
network admits one or more solutions, measurement at ^ ^ gives one of them 

(that it is a solution is checkable in polynomial time) . If the network admits no 
solution, conditions (i) through (iv) make up an impossible system. Measuring 
the network final state ~ at t = ^ — gives a non-solution. This is checkable in 
polynomial time and tells that the network is not satisfiable. 

It is clear from the above that Connections “cut” network complexity, in- 
ducing a divide-and-conquer strategy. This diakoptic approach would make NP- 
complete = P. However, we have been applying reverse engineering: until now the 
Ars projections are just a nice-to-have feature. This raises the problem whether 
this feature can be physical. 

5 Induced Symmetry 

Ars symmetry will be shown to be an epiphenomenon of fermionic antisym- 
metry in a special physical situation. This is generated by submitting a couple 
of identical fermions 1 and 2 to a suitable Hamiltoniant^^l . We assume that 
each fermion has two compatible, binary degrees of freedom y and A. Just for 
the sake of visualization (things should remain more abstract), we can think 
that each fermion is a spin 1 /2 particle which can occupy one of either two 
sites of a spatial lattice, y can thus become the particle spin component Uz 
(y = 0, 1 correspond to CTz = down, up) and X = r,s the label of the site occu- 
pied by the particle. For example, |0)^ |1)2 |r)j^ |s)2 reads: CTz of particle 1 down 
(0), CTz of particle 2 up (1), site of particle 1 = r, site of particle 2 = s. There 
are 16 combinations like this, which make up the basis of the Hilbert space 
H\^. However, there are only six antisymmetrical combinations (not violating 
statistics) which make up the basis of the “symmetrical subspace” of (the 
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pun should not be misleading). These basis vectors are represented in first and 
second quantization and, when there is exactly one particle per site, in qubit 
notation ((T 2 /A stand for the qubit eigenvalue/label): 
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creates a particle of spin x in site creation/ annihilation operators form 
the algebra: = WiyCij} = 0 , Now we introduce the 

Hamiltonian Hrs = Ea |a) (a| + Eb \b) {b\ + Ec |c) (c| + Ed |d) {d\ or, in second 

quantization Elrs = -(Ea al^aorOir + Ei, al^aosais +Ec aQ^aorOos + 
Ed a\r aLoirOis), with Ea, Eb, Ec, Ed > E discretely above 0. This leaves us 
with two degenerate ground eigenstates: 

|e) = ^ (| 0 ), | 1 ), + | 1 ), | 0 ) J and |/) = ^ (| 0 ), | 1 ), - | 1 ), | 0 )J . 

Alternatively, their linear combinations |0)^ |l)g and |1)^ |0)^ can be used as the 
two orthogonal ground eigenstates. The generic ground state is thus: 

\E) = a 10),, 11),, + /3 11),. |0)„ , with |a|^ + |/3|^ = 1, (5.1) 

which satisfies Ars symmetry. Let A 12 \'E) = 5(1 — P 12 ) be the antisymmetriza- 
tion projector. Due to the anticommutation relations: A 12 |0), |1),, = |0),. |1),, and 
Ai2 |1), |0), = | 1 ), | 0 ),; also, A12 | 0 ), | 0 ), = | 0 ), | 0 ), and A, 2 | 1 ), | 1 ), = | 1 ), | 1 )„ 

provided that | 0 ), | 0 ),, = |c) and | 1 ),. | 1 ),, = \d) (see further below) and without 

forgetting that these are excited states. 

The Connection can be implemented by suitably operating on the ground 
state (5.1). We assume that the initial, “symmetrical” state of the Connection 
is given by eq. (3.2): \E (0)) = cost? |0), |1),, + sind |1), |0),,. Then transforma- 
tion (3.1) [pr (t) = Qr {ojt) Pr (0) Ql (wt)] is applied to qubit r under continuous 
Ars projection. Let (t)) be a free normalized vector of i?Ax- The Connection 
state at time t is obtained by submitting \E (t)) to the following mathematically 
simultaneous conditions. 



for all t: 

i) Ai2\E{t)) = \E{t))-, 

ii) Pr (t) = Tts {\E (t)) {E (t)|) = cos^ + if) 10),. (0|, -k sin^ (d -k p) |1),. (1|,.; 
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iii) the distance between the vectors before and after projection is minimum as 
specified in Section 3; 

iv) the expected Connection energy, (t)) = {'P {t)\ Hrs\'l' {t)) , is minimum. 
Since this minimum will always be zero, time ordering is irrelevant. 

It is readily seen that the solution of this system is still \'I/ {t)) of eq. (3.3): 

\W (t)) = cos (i9 + ip) |0)^ |1)^ + sin (§ + p) |1)^ |0)^ . 

Simultaneous satisfaction of (i), i.e. fermionic antisymmetry seen as projection, 
and (iv) (which is satisfied by (^ (t)) = 0) originates the projection constraint 
Ars \'P {t)) = I'f' (t)), as is readily seen. Therefore, if = 0, namely if the 

operation on qubit r is performed adiabatically, we obtain the Connection. Since 
this computation is reversible ^'^ namely it does not dissipate free energy (the 
result of driving and shaping is a unitary evolution), in principle the operation 
can be adiabatic and (^ (t)) can always be zero. This is of course an idealization: 
actually we are highlighting a speculative, possible way of dealing with NP- 
complete problems. 

By the way we should note that the tensor products |0)^ |0)g and |1)^ |1)^ that 
would be projected off since they violate A^s symmetry (see the counterfactual 
reasoning of Section 2), are not the antisymmetrical excited states |c) and |d) 
satisfying A 12 . They would be instead the symmetrical states of 

|c)' = |0) JO), = ^ |0) JO)^ (|r) + k) Jr)^) , 

M)' = |i). |i). = ^ |i)i 11)2 (k)i 1^)2 + 1^)1 k) 2 ) ■ 

Let us consider the first equation. |c) has the same qubit notation as |c) although 
(c| c) = 0. Let 7 (7 ) be the amplitude of |c) (|c) ); a is the amplitude of 

2 2 ' ^ 

|0)^ |1),. The first diagonal element of Pi- ( t) would thus be ||a|| + UH + 7 
The assumption is that pr (t) rotation does not raise UH (initially zero) and 
consequently the energy, but 7 which would be immediately killed by A 12 
projection, thus always remaining zero as it should be. The same can be said for 
|d) and the second diagonal element of pr (t). 

Let us address the problem of creating many Connections, namely an Hrs 
Hamiltonian per network wire r, s (fig. 2a). These Hrs operate on disjoint pairs 
of qubits. Viewed as Ars projectors (which is the case when (^ (t)) = 0), they 
are pairwise commuting. Still in the idealized case of adiabatic operation, the 
Connections operate independently of each other. 

6 Conclusion 

The notion of applying a particle statistics symmetry (or projection) to divide 
the quantum whole into parts without clipping its richness — here computation 




A Diakoptic Approach to Quantum Computation 199 



speed-up[®’^°’^^’ among others] _ jntroduces an engineering (diakoptic) perspective 
in the design of quantum mechanisms. For the time being, this notion is devel- 
oped at an abstract level. This paper is the exploration of a possible, alternative 
form of quantum computation. Finding model Hamiltonians which implement 
the Hermitean matrix of Section 5 could possibly be the next step. 

The interpretation of particle statistics symmetry as projection on a prede- 
termined subspace is best modeled in a two-way (advanced and retarded in time) 
propagation scheme^^^’^^’^^l. 

Part of this work was completed during the 1997 Elsag Bailey I.S.I. Foun- 
dation research meeting on quantum computation. Thanks are due to A. Ekert, 
D. Finkelstein, L. Levitin, S. LLoyd, C. Macchiavello and T. Toffoli for useful 
suggestions. 
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Abstract. An experimental free-space quantum key distribution (QKD) 
system has been tested over an ontdoor optical path of ~ 1 km under 
nighttime conditions at Los Alamos National Laboratory. This system 
employs the Bennett 92 protocol; here we give a brief overview of this 
protocol, and describe our experimental implementation of it. An anal- 
ysis of the system efficiency is presented, as well as a description of our 
error detection protocol, which employs a two-dimensional parity check 
scheme. Finally, the susceptibility of this system to eavesdropping by 
various techniques is determined, and the effectiveness of privacy ampli- 
hcation procedures is discussed. Our conclusions are that free-space QKD 
is both effective and secnre; possible applications include the rekeying of 
satellites in low earth orbit. 



1 Introduction 

Quantum cryptography was introduced in the mid-1980s [Q as a new method for 
generating the shared, secret random number sequences, known as cryptographic 
keys, that are used in crypto-systems to provide communications security. The 
appeal of quantum cryptography is that its security is based on laws of Nature, 
in contrast to existing methods of key distribution that derive their security from 
the perceived intractability of certain problems in number theory 0, or from 
the physical security of the distribution process. 

Since the introduction of quantum cryptography, several groups have demon- 
strated quantum key distribution (QKD) over multi-kilometer distances of opti- 
cal fiber 0-HI3), and recent advances have led to demonstrations of QKD over 
free-space indoor optical paths of 205 m nn, and outdoor optical paths of 75 m 
m These demonstrations increase the utility of QKD by extending it to line- 
of-site laser communications systems. Indeed there are certain key distribution 
problems in this category for which free-space QKD would have definite prac- 
tical advantages (for example, it is impractical to send a courier to a satellite). 
We are developing QKD for use over line-of-sight paths, and here we report our 
results of free-space quantum key generation over outdoor optical paths of up to 
950 m under nighttime conditions. 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 200-ESl 1999- 
© Springer- Verlag Berlin Heidelberg 1999 
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2 Quantum-Key Distribution 

The faithful transmission of polarized single photons through a turbulent me- 
dium (the atmosphere), receiving them with non-negligible probability and de- 
tecting them against a high ambient background, appear to be serious obstacles 
to free-space QKD. However, these obstacles can be overcome by exploiting sub- 
nanosecond timing techniques, narrow wavelength filters mm spatial filtering 
n, and adaptive optics To define the problem, we will require the gener- 
ation of ~ 1, 000 secret key bits between a ground station and a low-earth orbit 
satellite (~ 300 km altitude) in one overhead pass (duration ~ 8 minutes). In 
the following analysis we will assume that the QKD transmitter (Alice) is at the 
ground station and the receiver is on the satellite (Bob). 

2.1 Free- Space Single Photon Detection and Transmission 

The operational wavelength for free-space QKD should be chosen for both good 
atmospheric transmission properties and high detection efficiency. We have cho- 
sen to work at 772 nm where the atmospheric transmission from surface to space 
can be as high as 80% and single-photon detectors with efficiencies as high as 
65% are commercially available (silicon avalanche photodiodes: APDs). Further- 
more, at these optical wavelengths depolarizing effects of atmospheric turbulence 
are negligible as is the amount of Faraday rotation experienced on a surface to 
satellite path. 

In order to detect a single QKD photon it is necessary to know when it will 
arrive. However, there will be variations in transmission time through the at- 
mosphere owing to turbulence induced variations in refractive index, with time 
scales of the order of 0.01 — 0.1s. Therefore, the photon arrival time can be com- 
municated to the receiver by using a bright (multi-photon) precursor reference 
pulse, transmitted 100 ns (say) ahead of each QKD photon. The bright pulse 
and the “single photon” (produced by highly attenuating the pulsed output of 
a semiconductor laser) would each be of a ~ 100-ps duration. (Note: the tempo- 
ral length of the bright pulse is not as restricted as the temporal length of the 
dim-pulse; in fact, the bright pulse only needs to be short enough to allow the 
detection of the bright- and dim-pulses within the time allowed by the trans- 
mission rate. We also note that the atmosphere is only weakly dispersive.) The 
received bright pulse would then allow the receiver to set a 1-ns time window 
(say) within which to look for the QKD photon. This short time window would 
reduce background photon counts dramatically, and these can be reduced fur- 
ther using narrow filters at the wavelength of the QKD photons as well as spatial 
filtering. For example, 1-nm interference filters can be used and even narrower 
atomic vapor filters (~ 10“^ nm) are possible. 

We now consider the rate at which QKD photons would be received at a 
satellite from a ground station transmitter. We will assume 20-cm diameter 
optics at both the transmitter and satellite receiver, leading to a ~ 1-m diameter 
diffraction-limited spot size at the 300-km altitude satellite. However, there will 
be beam-wander owing to turbulence which can be as much as ^ 10 times the 



202 R.J. Hughes et al. 



diffraction limit (i.e., 10 arc-seconds of wander) so that the photon collection 
efficiency at the satellite is ~ lO”"*. Thus, with a laser pulse rate of 10 MHz, 
one photon-per-pulse on average and an atmospheric transmission of ~ 80%, 
photons would arrive at the detector at a rate of ~ 1 kHz. Then, with a 65% 
detector efficiency and allowing for the 25% intrinsic efficiency of the quantum 
cryptography protocol, a key generation rate of ~ 150 Hz is feasible. With a 
beam tilt feedback system to keep the beam directed onto the satellite the key 
rate could be increased by a factor of 100. We must also consider the error rate. 

We first consider errors arising from background photons arriving at the 
satellite. On a night time orbit with a full moon a typical radiance observed at 
the satellite at the transmission wavelength would be ~ 1 mW m~^ str“^ /im“^ 
or ~ 4 X 10^® photons s“^ m~^ str“^ /xm“^. On a night with a new moon we take 
the background to be ~ 10^^ photons s“^ m“^ str“^ /im“^. We will assume that 
the receiver “sees” a solid angle ~ five times the apparent size of the source (i.e., 
5 arc-seconds) and that there is a 1-nm bandwidth interference filter placed in 
front of the detector, giving a background photon arrival rate of ~ 150 Hz (full 
moon); and ~ 4 Hz (new moon). With a 1-ns long time window on the detector, 
the probability of a background photon detection would be ~ 10“^ (full moon); 
and ~ 2.5 x 10“® (new moon) per 1-ns window. The single photon detector would 
only be triggered for precursor bright pulses that impinge on the satellite, giving 
approximately 120 detector triggers per arriving QKD photon, or 800 detector 
triggers per detected QKD photon. The bit error rate (HER) from background 
photons would therefore be ~ lO”"* (full moon); and ~ 2 x 10“® (new moon). 
Assuming a detector dark count rate of 50 Hz the HER will be dominated by 
background photons during full moon periods, and by detector noise during a 
new moon, with a HER ~ 5 x 10“^. 

On daytime orbits the background radiance would be very much larger, ~ 
10^^ photons s“^ m“^ str“^ /xm“^. Nevertheless, with an atomic vapor filter the 
rate of arrival of background photons would only be ~ 40 kHz (assuming a 10“^ 
nm filter width). The HER from this background would then be ~ 2%. 

From this simple analysis we see that QKD between a ground station and 
a low-earth orbit satellite should be possible on night time orbits and even in 
full daylight. During the several minutes that a satellite would be in view of the 
ground station there would be adequate time to generate tens of thousands of 
raw key bits, from which a shorter error-free key stream of several thousand bits 
would be produced after error correction and privacy amplification. A crypto- 
graphically useful quantity of key material could therefore be generated for this 
application. 

2.2 The Bennett 92 Protocol 

A QKD procedure starts with the sender, “Alice,” generating a secret random bi- 
nary number sequence. For each bit in the sequence, Alice prepares and transmits 
a single photon to the recipient, “Bob,” who measures each arriving photon and 
attempts to identify the bit value Alice has transmitted. Alice’s photon state 
preparations and Bob’s measurements are chosen from sets of non-orthogonal 
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Table 1. Observation Probabilities 



Alice’s Bit Value 
Bob Tests With 


“0” 

“1” 


“0” 

“0” 


“1” 


“ 1 ” 

“0” 


Observation Probability 


p=0 


P= I 


P= 1 


p=o 



possibilities. For example, using the B92 protocol m Alice agrees with Bob 
(through public discussion) that she will transmit a horizontal-polarized pho- 
ton, \h), for each “0” in her sequence, and a right-circular-polarized photon, |r), 
for each “1” in her sequence. Bob agrees with Alice to randomly test the po- 
larization of each arriving photon with vertical polarization, |u), to reveal “Is,” 
or left-circular polarization, |£), to reveal “Os.” In this scheme. Bob will never 
detect a photon for which he and Alice have used a preparation/measurement 
pair that corresponds to different bit values, such as \h) and |?;), which happens 
for 50% of the bits in Alice’s sequence. However, for the other 50% of Alice’s bits 
the preparation and measurement protocols use non-orthogonal states, such as 
\h) and \£), resulting in a 50% detection probability for Bob, as shown in Table 
□ Thus, by detecting single-photons Bob identifies a random 25% portion of the 
bits in Alice’s random bit sequence, assuming a single-photon Fock state with 
no bit loss in transmission or reception. This 25% efficiency factor is the price 
that Alice and Bob must pay for secrecy. 

Bob and Alice reconcile their common bits through a public discussion by 
revealing the locations, but not the bit values, in the sequence where Bob de- 
tected photons; Alice retains only those detected bits from her initial sequence. 
The resulting detected bit sequences comprise the raw key material from which a 
pure key is distilled using classical error detection techniques. The single-photon 
nature of the transmissions ensures that an eavesdropper, “Eve,” can neither 
“tap” the key transmissions with a beam splitter (BS), owing to the indivisibil- 
ity of a photon El, nor copy them, owing to the quantum “no-cloning” theorem 
| IT^ . Furthermore, the non-orthogonal nature of the quantum states ensures that 
if Eve makes her own measurements she will be detected through the elevated 
error rate she causes by the irreversible “collapse of the wavefunction PI.” 

2.3 Quantum-Key Transmitter: Alice 

The QKD transmitter for our experiments (Fig. consisted of a temperature- 
controlled single-mode (SM) fiber-pigtailed diode laser, a fiber to free-space 
launch system, a 2.5-nm bandwidth notch-filter, a variable optical attenuator, a 
polarizing beam splitter (PBS), a low- voltage Pockels cell, and a 27x beam ex- 
pander. The diode laser wavelength is temperature adjusted to 772 nm, and the 
laser is configured to emit a short, coherent pulse of approximately 1-ns length, 
containing ~ 10^ photons. 

A computer control system (Alice) starts the QKD protocol by pulsing the 
diode laser at a rate previously agreed upon between herself and the receiving 
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Fig. 1. QKD Transmitter. 

computer control system (Bob). Each laser pulse is launched into free-space 
through the notch filter, and the ^ 1 ns optical pulse is then attenuated to an 
average of less than one photon per pulse, based on the assumption of a statistical 
Poisson distribution [2()j . (The attenuated pulse only approximates a “single- 
photon” state; we tested the system with averages down to less than 0.1 photon 
per pulse. This corresponds to a 2-photon probability of < 0.5% and implies 
that less than 6 of every 100 detectable pulses will contain 2 or more photons, 
i.e., for a Poisson distribution, P", with an average photon number of n = 0.1, 
for every 1000 pulses there will be ~ 905 empty pulses, ~ 90 pulses of 1 photon, 
^ 5 pulses of 2 photons, and ~ 1 pulse of 3 or more photons.) The photons 
that are transmitted by the optical attenuator are then polarized by the PBS, 
which transmits an average of less than one \h) photon to the Pockels cell. The 
Pockels cell is randomly switched to either pass the “single-photon” unchanged 
as \h) (zero- wave retardation) or change it to |r) (quarter- wave retardation). The 
random switch setting is determined by discriminating the voltage generated by 
a white noise source. 



2.4 Quantum-Key Receiver: Bob 

The free-space QKD receiver (Fig. 0 comprised a 8.9 cm Cassegrain telescope 
followed by the receiver optics and detectors. The receiver optics consisted of a 
50/50 BS that randomly directs collected photons onto either of two distinct op- 
tical paths. The lower optical path contained a polarization controller (a quarter- 
wave retarder and a half-wave retarder), adjusted as an effective quarter- wave 
retarder, followed by a PBS to test collected photons for \h) (at first glance this 
may be confusing, but the effective quarter wave retarder converts \h) to |r) lead- 
ing to a 50% probability an |/i) photon will be detected); the upper optical path 
contained a half-wave retardeiQ followed by a PBS to test for |r) (again, perhaps 

^ A polarization controller was not required along the upper path because the 50/50 
BS trausmitted the P polarization (the component of polarization parallel to the 
plane of incidence) without introducing any phase shift, but the quarter- and half- 
wave retarder pair was necessary along the lower path because the BS reflected the 
P and S (component of polarization normal to the plane of incidence) polarizations 
differently, introducing some ellipticity to the reflected wave. 
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confusing, but an |r) photon traveling this path is converted to \l), resulting in a 
50% detection probability). The output port along each optical path was coupled 
by multi-mode (MM) fiber to a single-photon counting module (SPCM: EG&G 
part number: SPGM-AQ 142-FL). [Although the receiver did not include notch 
filters, the spatial filtering provided by the MM fibers effectively reduced noise 
caused by the ambient background during nighttime operations to negligible 
levels (the background was ^1.1 kHz).] 

Bit values are determined in the following fashion: a single jr) photon travel- 
ing along the lower path encounters the polarization controller, and is converted 
to I?;) and reflected away from the SPGM by the PBS, but a single \h) photon 
traveling the same path is converted to jr) and transmitted toward or reflected 
away from the SPGM in this path with equal probability; in contrast, a single \ h) 
photon traveling the upper path is converted to ju) and reflected away from the 
SPGM in this path, but a single jr) photon traveling this path is converted to \l) 
and transmitted toward or reflected away from the SPGM with equal probability. 

In this detection scheme, there are a total of four possible optical paths 
through the receiver, but only two of the paths, those which terminate upon 
the detectors seen in Fig. |21 contain definite polarization information (definite 
in the sense that Bob can know what polarization Alice has transmitted if one 
of these detectors fire). However, while the remaining two paths contain inde- 
terminate polarization information (indeterminate in the sense that Bob cannot 
know with certainty whether Alice has transmitted jh), or jr) if a detector placed 
in either of these paths fires), but this information is important for the secure 
implementation of B92, as will be seen later (see Sec EH). 



3 Outdoor Free-Space Experiments 

The transmitter and receiver optics were operated over 240-, 500-, and 950-m 
outdoor optical paths, with the transmitter and receiver collocated in order to 
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simplify data acquisition. The various total optical path lengths were determined 
by positioning a 25.4 cm diameter mirror at the transmission distance half way 
point that reflected the transmitted beam back to the receiver. All measurements 
were made at night. 

3.1 System Efficiency 

In determining Bob’s bit-rate, we consider that a BS partitions a weak photon 
stream in a binomial fashion pnj. We further assume that the effective wave re- 
tarders, combined with the BBSs, behave together as 50/50 BSs when analyzing 
non-orthogonal polarizations, i.e, if Alice transmits \h) and Bob analyzes with 
\tj, or if Alice transmits |r) and Bob analyzes with |u). In addition, we treat 
the detectors as BSs with transmission coefficient Tjj = 0.65, or in other words, 
that the detector, with efficiency r]D = 0.65 = To, also detects photon streams 
in a binomial way. We also treat the transmission and reception efficiency r ] — 
or power losses between the transmitter and receiver together with the losses 
which occur coupling power into the receiver’s MM fibers — as random binomial 
processes. From the general form of the binomial probability distribution, we 
have 

n 

Pi, ^ , (1) 

m—l 

the probability that at least 1 photon out of n photons will be transmitted 
through the optical elements along the optical path (this is important because 
the detector responds to one or more photons) . The net transmission probability 
is T, the reflection probability is R, and T + R = 1. 

For calculation purposes, we use Eq. |3 which is equivalent to Eq. 0 

= 1 - (1 - ,7 ■ 77 ,, ■ 1/2 • l / 2 )^ ( 2 ) 

where T ^ p ■ po/^, V and po are as previously defined, and the factor of 
1/4 = 1/2 • 1/2 gives the probability that a photon collected at the receiver, 
of either \h), or |r), will be transmitted through the 50/50 BS followed by the 
effective quarter- wave retarder and PBS (for an \h) photon), or the half-wave 
retarder and PBS (for an |r) photon). 

These binomial expanded products (Eq. 0 of p, 1/4, and po, are convolved 
with the Poisson probabilities, that there will be exactly n photons in a pulse 
given that the average number of photons per pulse is n: 

r.n exp (-h) 

” “ n! ■ ^ ^ 

The convolution is then summed to give the detection probability as a func- 
tion of the Poisson average photon number. This probability multiplied by the 
rate at which Alice transmits the coherent pulses, TZa, gives the rate at which 
Bob detects Os and Is, TZb- 

OO 

Rb = Ra [1 - (1 -PPdI^T] • 

n—1 



(4) 
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Table 2. A 200-Bit Sample of Alice’s (A) and Bob’s (B) Raw Key Material 
Generated by QKD over 1 km. 



A 


0000010101 


1101101001 


0000000000 


0110010101 


B 


0000010101 


1101101001 


0000000000 


0110010101 


A 


0011100010 


0111011101 


1110111000 


0100100011 


B 


0011100010 


0111011101 


1110111000 


0100100011 


A 


1110000000 


0101101111 


1001001010 


0010000011 


B 


1110000000 


0101101111 


1001001010 


0010000011 


A 


0000010111 


0000111111 


1111000000 


1010101101 


B 


0000010111 


0000111111 


1101000000 


1010101101 


A 


1111100111 


1110111101 


0100110100 


1011101111 


B 


1111100011 


1110111101 


0100110100 


1011101111 



Our experimental result was 7 ?.b ^ 50 Hz when the transmitter was pulsed at a 
rate of TZa = 20 kHz, with n = 0.1 photon per pulse for the 950-m path. 

Finally, we note that in the limit that 77 • i— > 1, and given a Fock state 
of m = 1 photon, then the photon probability distribution P" Sm-i-, he., 
5m-i = OVm 1. In this limit — the limit of a perfect, lossless system — the sum 
vanishes and we are left with exactly 1 term TZb = P-a/ 4, which shows that Bob 
and Alice sacrifice 75% of their bits for privacy in agreement with Sec. 12.21 

3.2 Error Rate 

The bit error rate (BER) for the 950 m path was ^ 1.5% when the system was 
operating down to the <0.1 photon per pulse level, where the BER is defined 
as the ratio of the bits received in error to the total number of bits received. A 
BER of ~ 0.7% was observed over the 240-m optical path and a BER of 1.5% 
was also observed over the 500 m optical path. A sample of raw key material 
from the 950-m experiment, with errors, is shown in Table |5| 

Spatial filtering reduced the ambient background (~ 1.1 kHz), and the narrow 
gated coincidence timing windows (~ 5 ns) reduced bit errors caused by the 
ambient background to less than ~ 1 every 9 s. Further, because detector dark 
noise (~ 80 Hz) contributed only about 1 dark count every 125 s, we believe that 
the BER was caused by misalignment and imperfections in the optical elements 
(wave-plates and Pockels cell). 

3.3 Error Detection 

Our experiments implement a two-dimensional (2D) parity check scheme that 
allows the generation of error-free key material. Error detection is accomplished 
by Bob and Alice organizing their reconciled bits (see Sec. r2.2l) into 2D square 
matrices in the order that they were detected. Once organized, the parities of 
the rows and columns are determined and openly exchanged between Alice and 
Bob, and any column or row in which Bob and Alice possess different parities 
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is discarded. To ensure privacy, Alice and Bob also discard the bits oriented 
along the diagonals of their matrices. This guarantees the elimination of two 
bits for each row and column of the matrix, even when no errors are detected, 
eliminating knowledge revealed during the parity exchange. 

Figure 0 illustrates the error detection protocol. In this example, Alice pos- 
sesses the ‘good’ bits, and it is necessary for her and Bob to remove his ‘bad’ 
bits and distill error free key material. Bob possesses only two bad bits, but af- 
ter openly communicating the column and row parities, they sacrifice good bits 
along the diagonals, and the 2 rows and 2 columns where parity differences were 
seen (parity differences are seen in columns 3 and 6 and rows 3 and 6). The net re- 
sult, in this example, is 24 error-free bits: key := {100000110111110000010111}. 
Thus, in addition to the minimum 75% key lost during the B92 protocol. Bob 
and Alice have sacrificed another 62.5% of the detected bits. More complicated 
error detection codes could be employed to detect these as well, such as cyclic re- 
dundancy codes 1221 , but this was not done in our proof of principle experiment. 
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Fig. 3. Two-dimensional parity check scheme. 



This is not the whole story, because the detection protocol does not detect 
all errors. For example, 2 errors in a column, combined with another error in 
a row containing one of the column errors (an ‘L’ shaped pattern), results in 
a missed bit-error. If there were 4 errors in a ‘box’ pattern, none of the errors 
would be detected, and so on. 

We must emphasize, however, the strengths of the 2D routine as well. For 
example, the minimum Hamming distance mi22i, d, for a 2D scheme is the 
square of the minimum Hamming distance of the same detection scheme imple- 
mented in one-dimension (ID). (The Hamming distance tells how many errors 
can be detected, and/or corrected — one can detect d—1 errors.) For our particu- 
lar detection code, a parity check code, the minimum Hamming distance is 2 for 
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the ID case, but in 2D this becomes 4. Once again, this is not the whole story, 
because there are situations in the ID parity check scheme where more than one 
error can be detected, if the word is long enough; parity in ID can detect an odd 
number of bit flips: 1, 3, 5, etc; however, even parity flips cannot be detected^ 
To test our error detection scheme, we simulated random bit strings, with 
errors, and found that key material with bit errors as low as a few tenths of a 
percent had to be processed with the detection protocol at least twice to reduce 
errors to negligible levels. For random bit strings with high BERs (BERs of more 
than a couple of percent), small 2D matrices were needed on the first pass, but 
with each subsequent detection pass a larger 2D matrix could be used. We found 
that bit strings with BERs as high as 10% could be reduced to an estimated ~ 1 
bit-error in a total of 10® bits after 4 passes, with ~ 14% of the initial key 
remaining; the sizes of the matrices in the 4 passes were 6 by 6, 7 by 7, 13 
by 13, and 13 by 13, respectively^ (We never operated our system with BERs 
this high, but in our simulations we wanted to determine the detection scheme’s 
capabilities. We also found that there exists an optimal matrix size which most 
efficiently reduced errors while preserving a maximal amount of key material. 
The sizes varied from a 6 by 6 to a 12 by 12, almost linearly, for BERs between 
10% and 1%.) 

4 Eavesdropping: An Attack by Eve 

Much has been said about the security of QKD against attack by an eavesdropper 
m- There are essentially two types of attack to consider: opaque attacks and 
translucent attacks. 

4.1 Opaque Attack 

In an opaque attack. Eve intercepts all collectable bits, or single photons, by 
positioning herself between Alice and Bob. If Eve possesses a transmitter and 
receiver identical in every way to Bob’s receiver and Alice’s transmitter, and 
Bob, Alice and Eve are operating under the B92 protocol, then Eve can deter- 
mine as much information about the key as could Bob. For example, if Alice’s 
transmission basis is \h) and |r), and Eve’s measurement basis is \tj and 
then Eve can know Alice’s transmitted bits with a maximum efficiency of 25% o 
If Eve retransmits the bits she “knows,” then she will lower Bob’s expected bit- 
rate, relative to Alice, by at least a factor of 4, but she will be forwarding bits 
of the correct value to Bob. 

^ We only detect and eliminate errors, and do not attempt to correct them. 

® We used square matrices of an odd size (7 by 7 and 13 by 13) in our simulations, 
but decided against using them in our experiments when we determined that the 
elimination of the diagonals of the matrices of odd size was insufficient to ensure 
security. 

In a real system. Eve will experience reception losses associated with the collection, 
fiber launch and detection of the single photons. 
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If Eve can collect, measure, and quickly retransmit the bits she detects, she 
can then listen to Alice’s and Bob’s open bit reconciliation protocol (see Sec. 
E3)- And, while Bob never reveals his bits values. Eve still knows what bits 
Alice and Bob commonly share because she knows when Alice began transmitted 
random bits. At this point, if Alice and Bob know their system well. Eve has 
been revealed by the additional factor of 4 attenuation, e.g.. Eve has discarded 
a minimum of 75% of her bits, but Alice has discarded a minimum of 93.75% of 
her bits, i.e.. Eve discards (1 — 1/4), but Alice discards (1 — 1/16). 

Some could argue this additional attenuation to Bob’s and Alice’s common 
key is protection enough against an opaque attack, but our implementation of 
B92 adds another layer of protection if Eve attempts to bring Bob’s bit rate 
to a rate indistinguishable from her own. Eve can do this by retransmitting a 
bright classical pulse to Bob for each single photon she detects^ However, our 
system protects against this attack when operated in either a 2, 3, or 4 SPCM 
mode. In a 2 SPCM system, this type of attack would be revealed through 
an increase in “dual-fire” errors. Dual-fire errors occur when both SPCMs fire 
simultaneously. (In a perfect system there would be no dual- fire errors, regardless 
of the average photon number per pulse. However, in an imperfect experimental 
system dual-fire errors will occur, because there will be bit-errors associated with 
the transmission and measurement protocols, i.e., impure bit preparation and 
measurement associated with optical alignment of the transmission, receiving, 
and analysis optics.) 

If we consider only a perfect system, then no matter how many horizontally 
polarized photons travel the |r) analysis path, none will reach the |r) analyzing 
detector. However, if this analysis path includes an effective half-wave retarder 
followed by a PBS, then the half wave-retarder will convert right-circular polar- 
ized photons to left-circular polarized photons which will then be equally split 
equally between the two output paths. If both paths are each followed by an 
SPCM then both SPCMs will fire. 

The component of a right-circular polarized pulse that travels the | h) analysis 
path encounters the effective quarter-wave retarder followed by another PBS. 
The quarter-wave retarder converts this right-circular polarized ‘bright’ pulse to 
a vertical-polarized ‘bright’ pulse which is reflected along the path away from 
the \h) analyzing detector. If this path contains an SPCM, then this SPCM will 
fire together with the two SPCMs which terminate on the |r) analyzing path. 
Thus, 3 of 4 detectors have fired alerting Bob and Alice that Eve is opaquely 
attacking the key. A similar argument applies if Bob is using 3 detectors. 



4.2 Translucent Attack 

Eve could also passively, or translucently, attack the quantum transmission with 
a BS. In this scheme. Eve receives the binomial reflection probability of the BS 
she uses to reflect photons toward her receiving optics, and Bob receives the 

® In B92 it is possible to send bright classical pulses of the appropriate polarization to 
ensure that every bit transmitted is detected at the receiver. 
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binomial transmission probability of the BS Eve uses. In a translucent attack 
it is necessary to consider Eve’s and Bob’s reception and detection efficiencies, 
which are independent; Eq. 0 shows the amount of information on the key Eve 
receives as a function of the reflection coefficient, Re = 1 — Te, of BS she uses 
in her translucent attack, her receiver efficiency tje, and her detector efficiency, 
Tj^. Equation shows the amount of key Bob receives as a function of the 
transmission coefficient, Te, of the BS Eve uses in a translucent attack, and his 
reception and detection efficiencies, r]B and 

Ue = Ua [1 - (1 - , (5) 

n— 1 

and 

TZb = Pa f^P: [1 - (1 - , ( 6 ) 

n— 1 

The 1/4 reduction of these products is as previously described in Sec |3 Eve’s 
bit rate is TZe, and TZb is Bob’s, and TZa is the rate Alice is transmitting. 

The privacy P, or the percentage of information Eve possesses on Alice’s and 
Bob’s common key, is determined as the ratio of the number of bits Eve and Bob 
share (observe coincidently) to the number of bits Alice and Bob share. First of 
all, if there is only 1 photon in a pulse, then either Eve or Bob will receive it, 
but not both. Based on this premise, Eq. 0 shows the number of bits that Bob 
and Eve will share if Eve attacks the key with a BS of transmission coefficient 
Te, and reflection coefficient = 1 — Tg. 



OO n-l B E 

Nbae = [l-(l-^n[l-(l-^)"-™] (7) 

n—2 m—1 

Equation El shows Alice’s and Bob’s privacy, P. 

OO n— 1 

E OT’W™!! - (1 - ^)’"][i - (1 - 



p = 



n—2 m—1 









(8) 



n—1 



Under this type of translucent attack, if Eve uses 50/50 BS, and if Alice 
transmits coherent Poisson pulses with an average of 0.1 photon per pulse, and 
if Bob’s and Eve’s system and detection efficiencies are equal, then for every 250 
bits Eve and Bob acquire. Eve will commonly share ~ 3 of her 250 bits with 
Bob’s 250 bits, or ~ 3/250 of Alice and Bob’s common key. In fact, because Eve’s 
knowledge on Alice’s and Bob’s common key is coupled to hers and Bob’s system 
efficiencies, this situation represents the maximum amount of information Eve 
can obtain on Alice’s and Bob’s common key even if her system is perfectly 
efficient and Bob’s is not. The inverse is also true, i.e., if Bob’s system is more 
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efficient than Eve’s, then the amount of information Eve can determine on Alice’s 
and Bob’s common key decreases. (Note: at this point, we have only been able 
to show this empirically, but we have found no exceptions to these facts). Eve 
could determine which bits she commonly shares with Bob when Alice and Bob 
reconcile their common bits. 

Finally we note that because Alice transmits coherent states, as opposed to 
single photon Fock states, she and Bob also need to add a stage of “privacy am- 
plification l2ilH ” to reduce any partial knowledge gained by an eavesdropper to 
less than 1-bit of information. We have not implemented such a privacy ampli- 
fication protocol at this time, but our free-space QKD system does incorporate 
“one time pad 1241 ” encryption — also known as the Vernam Cipher: the only 
provably secure encryption method — and could also support any other symmet- 
ric key system. 

5 Conclusions 

The results in this paper demonstrate free-space QKD through a turbulent 
medium under nighttime conditions. We have described a system that provides 
two parties a secure method to secretly communicate with a simple system based 
on the B92 protocol. We presented two attacks on this protocol and demon- 
strated the protocol’s built in protections against them. This system was op- 
erated at a variety of average photon number per pulse down to an average 
of < 0.1 photon per pulse. The results were achieved with low BERs, and the 
240-m experiment demonstrated that BERs of 0.7% or less are achievable with 
this system. This protocol could be implemented with classical signature au- 
thentication | 2 ] and privacy amplification procedures to ensure the security of 
private information. From these results we believe that it will be feasible to use 
free-space QKD for re-keying satellites in low-earth orbit from a ground station. 
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Abstract. The various eavesdropping strategies on a quantum crypto- 
graphic channel as discussed by Gisin et.al. can be reproduced by unsharp 
spin (represented by positive operator valued measure) measurement of 
spin-1/2 particle. An upper bound for unsharp parameter is obtained so 
that it still gives rise to violation of Bell’s inequality for the two particle 
transmission channel. 



Recently Gisin and Huttner [1] analysed various eavesdropping strategies on a 
quantum cryptographic channel. In their treatment with a 2-D probe, they have 
taken a particular evolution and have shown that for spin-1/2 particle, Eve’s 
eavesdropping simply shrink the Block vector of the Alice’s state by a factor rj 
which is a function of measurement of intensity 7 (0 < 7 < tt/ 2). Then they 
have applied the same kind of interaction in the case of cryptographic protcol 
with two spin- 1/2 particles in a singlet state and have calculated the parameter 
S that appears in the GHSH version of Bell’s inequality as a function of the same 
7 - 

In this paper, we will reproduce these particular results applying unsharp spin 
measurement formalism [2] which represents Eve’s interference with the particle 
reaching to Bob. 

For this purpose let us shortly describe the unsharp measurement of spin prop- 
erty. In unsharp formalism the spin property is represented by POV measure 
which is more general than projection operator. We write the POV measure 
representing the unsharp property [2,3] 

Ex{n) = ^[I + Xn.a] (1) 

where n is a unit vector and A is real with 0 < A < 1. For A = 1, it becomes 
sharp spin property E(n). 
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E\{n) can be written as 

Ex{n) = + ^-^E{-n) (2) 

where 

E{-n) = {I - E{n)) 



= r > 1/2 is called degree of reality and = u < 1/2 is called degree of 
unsharpness for the spin property. 



Now every measurement corresponds to to an operation [4] which gives the 
final state of the system. Here we consider the generalised Luder operation [4,5] 
which disturbs the initial state minimally. The Luder generalised operation t/i 
corresponding to the measurement E\{n) on a state represented by the density 
operator p = + a.a] is given by 

(j)LP= {Ex{n)Y/'^p{Ex{n)Y/‘^ 



= ]^Tr[pEx{n)] + ^{(1 - - n(n.a)] + n[A + {a.n)]}.a 

Let the state prepared by Alice is given by 

p Alice = + rn. a] 



(3) 

(4) 



where m is an unit vector known as Bloch vector representing pAUce on the 
Poincare sphere. 



Alice sends this state to Bob. But before reaching to Bob, Eve makes some 
unsharp measurement on this state. So the changed state due to the measurement 
disturbance (without reading the result) which reaches to Bob is given by 

PBob = {Ex{n)Y/^pAUce{Ex{n)f/^ + (/ - Ex{n)f'^ PAUce{I - Ex{n)f/^ (5) 

Applying ® we get 

PBob = + ^["1 + (1 - A^)^/^{(n.m)n - m}].a (6) 

But from such state reaching to Bob, Alice and Bob can easily infer that the noise 
in that the transmission is not produced by random process and hence get some 
information about Eve’s strategy. For avoiding that Eve must perform two spin 
measurements, one along direction n and another along n (along perpendicular 
direction to n) randomly. 

From Ex{n) measrement, the resulting state will be given by 
PBob A^)^/^{(h.m)h - m}].a 



( 7 ) 
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So the final density matrix for the random combination of both the strategies 
can be written as 

pBob = -^PBob + -^PBob ~ 2^^ 2^^ + (1 ~ A^)^/^}m.cr] (8) 

which is same as the density matrix in equation (10) of reference 1 with 

(1 — = cosy 

The effect of Eve’s eavesdropping is thus simply to shrink the Bloch vector by 
the factor 

1{1+(1_A2)V2} 

which is a function of the unsharp parameter A. 

Let us now turn to the strategy of cryptography with the EPR pair one with 
Alice and the other reaching to Bob. On the later Eve makes some unsharp 
spin measurement. We want to see how much unsharpness is allowed for spin 
measurement so that Alice and Bob still get the violation of Bell’s inequality. 

We write the singlet state as 

V’O = -^ [ipn O i’-n - O i’l] (9) 

where particle 1 reaches to Alice and 2 reaches to Bob and 'ip± are eigen states 
of (Tz- The state after Eve’s unsharp spin measurement on the particle 2 will be 
given by 

W = / O (EA(n))i/2p[V^o]/ O (Exin))^^^ 

+ 1 0 (/ - Ex{n)y^^P[MI O (7 - Ex{n)y/^ 

1 1 (10) 

>< i’-nl O \i’-n >< V’il - C.C 

Here P[.] represents projection operator on the vector in the square bracket. 

Now the expectation value of the Bell operator (for the choice of spin observables 
giving maximal violation in a singlet state) for the state W is 

< BbsU >w= V2[1 + (1 — A^)^/^] (11) 

Again Bell’s inequality satisfies 



2 < < Bsell > <2 
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So for violation of Bell’s inequality we need 

A < \/2(^/2- 1)1/2 

In conclusion, it is to be cleared that we completly agree to the result obtained 
by Gisin et.al. for various Eavesdropping strategies. We only showed that their 
particular measurement scheme for producing the results as discussed above, 
is equivalent to unsharp spin measurement with generalised Luder operation. 
In cryptography, this equivalency is important because sometimes the unsharp 
measurement produce the same information as its sharp counterpart but disturbs 
the initial state minimally. 
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Abstract. An information-theoretic approach to quantum copying is 
discussed, relying on the notion of quantum loss, a quantity that reflects 
the transmission quality in a noisy quantum channel. More specifically, 
an entropic no-cloning inequality is derived for a Hilbert space of arbi- 
trary dimension, which describes the tradeoff between the losses of the 
channels leading to the two copies. Then, focusing on quantum bits, a 
family of Pauli cloning machines is introduced. These machines produce 
two imperfect copies of a single quantum bit that emerge from two dis- 
tinct Pauli channels. The balance between the quality of the two copies is 
shown to result from a genuine complementarity principle. In the special 
case where the two outputs are associated with depolarizing channels of 
probability p and p' , the domain in [y/p, v^)-space located inside a par- 
ticular ellipse representing close-to-perfect cloning is forbidden. Finally, 
the class of symmetric Pauli cloning machines is used to provide an upper 
bound on the quantum capacity of the Pauli channel of probabilities Px, 
Py and Pz- The capacity is proven to be vanishing if (y'pT, y/Wz) lies 
outside an ellipsoid whose pole coincides with the depolarizing channel 
that underlies the universal cloning machine. 



1 Introduction 

A remarkable property of quantum information is that it cannot be copied, in 
contrast with information we are used to in classical physics. This means that 
there exists no physical process that can produce perfect copies of a system that 
is initially in an unknown quantum state. This so-called no-cloning theorem, 
recognized by Dieks Q and Wootters and Zurek j5|, is an immediate consequence 
of the linearity of quantum mechanics, and lies at the heart of quantum theory. 
Indeed, if perfect cloning was permitted, the Heisenberg uncertainty principle 
could be violated by measuring conjugate observables on many copies of a single 
quantum system. 

* This paper was presented at the 1st NASA International Conference on Quantum 
Computing and Quantum Communications, Palm Springs, February 1998. 
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Consider a cloning machine that duplicates a quantum bit (a 2-state system) 
that is initially in an arbitrary state \ip) = a|0) -I- /3|1) whose amplitudes are 
unknown. It is easy to build such a machine that perfectly copies the two basis 
states |0) and |1), but then it badly duplicates the superpositions 2“^/^(|0)±|l)). 
In other words, it cannot produce perfect copies of all possible input states. This 
being so, we may ask how well one can approximately duplicate the unknown 
state of a quantum bit (qubit) if the quality of the copies is required to be 
independent of the input state. This question was answered by Buzek and Hillery 
who first showed that it is possible to construct a cloning machine that yields 
two imperfect copies of a single qubit 0 . Specifically, a universal cloning machine 
(UCM) can be defined that creates two copies characterized each by the same 
density operator p, the fidelity of cloning being / = {'if\p\'if) =5/6. This machine 
is called universal because it produces copies that are state-independent, both 
output qubits emerge from a depolarizing channel of probability 1/4, that is, the 
Bloch vector characterizing the input qubit is shrunk by a factor 2/3 regardless 
its orientation. The UCM was later proved to be optimal by Bruss et al. Pj, 
and Gisin and Massar 0. Much attention has been recently directed towards 
quantum cloning machines, because of their use in connection with quantum 
communication and cryptography (see, e.g., OT). 

The outline of the paper is as follows. In Section 0 we start by reviewing 
the characterization of a noisy quantum channel by its loss, a quantity that re- 
flects the quality of the transmission. The loss, depending on the input and the 
operation performed by the channel, can be shown to vanish when the trans- 
mission of quantum information is perfect. This concept is used to display the 
information-theoretic significance of the quantum no-cloning theorem. More pre- 
cisely, an entropic no-cloning inequality is derived, characterizing the impossi- 
bility of copying imposed by quantum mechanics: La -\- Lb > 2S, where La and 
Lb are the losses characterizing outputs A and B, respectively, while S is the 
source entropy. In Section 0 we introduce a family of asymmetric Pauli cloning 
machines (PCM), which produces two distinct (approximate) copies of a single 
quantum bit, each emerging from a Pauli channel 0. This is in contrast with 
the cloning machines considered in the literature, which are symmetric (both 
outputs being characterized by the same density operator). The family of PCMs 
relies on a parametrization of 4-qubit wave functions for which all qubit pairs 
are in a mixture of Bell states. 

Using a particular class of asymmetric PCMs whose outputs emerge from 
(distinct) depolarizing channels, we derive a no-cloning uncertainty relation 
governing the tradeoff between the quality of the copies of a quantum bit: 

-\- ab -\- b'^ > 1, where of and 6^ are the depolarizing fractions of the chan- 
nels associated with outputs A and B, respectively. It is, by construction, a 
tight inequality which is saturated using our PCM. More generally, the com- 
plementarity between the two copies produced by a Pauli cloning machine is 
shown to result from an uncertainty principle, much like that associated with 
Fourier transforms. This uncertainty principle relates the probability distribu- 
tions underlying the channels leading to the two outputs of the doner. Finally, 
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the subclass of symmetric PCMs is used in order to express an upper bound on 
the quantum capacity of the Pauli channel. In particular, the capacity of the 
Pauli channel of probabilities Px = x'^ , Py = y'^ and p^ = z'^, is shown to vanish 
if {x, y, z) lies outside the ellipsoid x^ + y"^ + z"^ + xy + xz + yz = 1/2, whose 
pole coincides with the depolarizing channel underlying the UCM. This implies 
an upper bound on the capacity C of any Pauli channel associated with a point 
{x, y, z) located inside the ellipsoid, namely C < l — 2{x‘^ + y‘^ + z‘^ + xy+xz+yz). 

2 Information-Theoretic Significance of the Quantum 
No-cloning Theorem 

2.1 Entropic Characterization of a Noisy Quantum Channel 

Let us outline the entropic treatment of a noisy quantum channel that is intro- 
duced in Refs. [iSiq) . The description of a channel involves three quantum systems 
of arbitrary dimensions: X (the input quantum system whose processing by the 
channel is concerned), R (a reference system which X is initially entangled with), 
and E (an environment which X is interacting with in the noisy channel, induc- 
ing decoherence) ^0]. More specifically, we assume that X is initially entangled 
with R, so that the joint state of X and R is the pure state \^rx)- We may as 
well regard X as a quantum source, being initially in a mixed state px (realized 
by a given ensemble of quantum states associated with some probability distri- 
bution). The “purification” of px into \1^rx) can always achieved by extending 
the Hilbert space Tix to Hrx, so that we have px = T^^r{\'Erx){'Erx\)- The 
corresponding reduced von Neumann entropies are 

S{R) = S{X) = S (1) 

where S = — Trx(px log px) is called the source entropy and is a function of 
Px, the density operator characterizing the input X. In the dual picture where 
an arbitrary state of X (rather than entanglement) is sent through the channel, 
S then measures the “arbitrariness” of the input X (it can be viewed as the 
average number of quantum bits that must be processed by the channel in order 
to transmit the state of X). In our information-theoretic characterization of 
quantum channels, we prefer to consider an input X that is entangled with R, 
so that we investigate the transmission of entanglement rather than of arbitrary 
states. The initial quantum mutual entropy to be transmitted in a quantum 
channel is thus 

S{R-.X) = S{R) + S{X) - S{RX) = 2S (2) 

that is, twice the source entropy. 

When it is processed by the noisy quantum channel, X interacts with E (as- 
sumed to be initially in a pure state |0)) according to a particular unitary trans- 
formation U, inducing decoherence (see Fig. [Q. This depicts the most general 
(trace-preserving) operation of a quantum channel that is allowed by quantum 
mechanics. Roughly speaking, only a fraction of the initial entanglement with R 
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can be recovered in general after processing by the channel, while the rest of the 
entanglement with R is lost (i.e., it is transferred to the environment E). More 
specifically, the quantum output Y (i.e., the decohered quantum system after 
interaction with E) is characterized by the density operator 

py = TrB(t/(pjf ®|0)(0|)t/t) (3) 

The completely positive linear map px Py corresponds to the “quantum 
operation” performed by the noisy channel cni. 



Fig. 1. Noisy quantum channel of input X (initially entangled with a reference 
R) and output Y . Decoherence is induced by the interaction with an environment 
E, initially in the pure state |0). 
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The processing of quantum information through the channel X ^ Y can be 
characterized by two entropies, the quantum mutual entropy I and the quantum 
loss L: 



I = S{R:Y) (4) 

L = S{R:E) (5) 

where E denotes the environment after decoherence while R is the reference 
before or after decoherence (i? is not involved in decoherence) . It can be shown 
that / and L are independent of the choice of the reference system R provided 
that the latter purifies the input X. They depend in general on the channel 
input (i.e., px) and on the quantum operation performed by the channel (i.e., 
the completely positive trace-preserving map px Py that is specified by U in 
the joint space of X and E). The processing of quantum information in the noisy 
channel is characterized by the balance between I and L, these two quantities 
always summing to twice the source entropy: 

I + L = 2S (6) 

Physically, I represents the amount of the initial mutual entropy with respect 
to R (i.e., 25) that has been processed by the channel, while L corresponds 
to the fraction of it that has been unavoidably lost in the environment. If the 
channel is lossless (L = 0), then a perfect transmission of quantum information 
can be achieved by applying an appropriate decoding IMEi. In other words. 
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the interaction with the environment can be perfectly “undone” , and the initial 
entanglement with R can be fully recovered. (Equivalently, this means that an 
arbitrary initial state can be recovered without error.) Conversely, if / = 0, 
then no information (classical or quantum) can be processed by the channel. In 
between these limiting cases, the loss provides a measure of (or rather a bound 
on) how reliably quantum information can be transmitted |^. 



2.2 Entropic No-cloning Inequality 

Let us consider a cloning machine of input X and outputs A and B. In other 
words, A and B are the (approximate) copies of X produced by the cloning 
machine. The above entropic analysis can be used by associating the input-to- 
output overall operation characterizing each copy with a noisy quantum channel. 
For that purpose, we assume as before that the input of the doner, X, is initially 
entangled with a reference system R. The question will be to determine to what 
extent entanglement (with respect to R) can be transmitted simultaneously to 
both outputs, A and B. To be general, we also assume that the cloning machine, 
denoted as C, is itself involved in the copying process. Thus, the action of the 
doner can be described, in full generality, as a particular unitary operation acting 
on X together with two auxiliary systems (each being initially in a prescribed 
state |0)), yielding A, B, and C (see Fig. E)- 



Fig. 2. Quantum cloning machine of input X (initially entangled with a reference 
R) and outputs A and B. The symbol C refers to an ancilla or the cloning 
machine. 
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Thus, the first output A emerges from the channel X ^ A, which can be 
characterized by the loss 

La = S{R:BC) (7) 

since B and C play the role of an environment for this channel. Similarly, the 
second output B emerges from the channel X ^ B, which is associated with a 
loss 



Lb = S{R:AC) 



( 8 ) 
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If a successful cloning was achieved, one expects that La = Lb = 0, that is, 
both channels would transmit quantum information perfectly^ However, using 
the chain rule for quantum mutual entropies, it is easy to show that 

La = S{R:BC) = S{R:B) + S{R:C\B) > S{R:B) (9) 

where we have used the property of strong subadditivity of quantum entropies, 
namely S{R\C\B) > 0. From Eqs. (I4lti|) . the mutual entropy of the channel 
X ^ B can be written as S{R:B) =2S — Lb, where S = S{X) is the quantum 
source entropy. As a consequence, we obtain the entropic no-cloning inequality 

La + Lb>2S (10) 

which implies that the losses La and Lb characterizing the two copies cannot 
vanish simultaneously for S > 0 (i.e., when attempting to clone a state that is 
not fully known). This inequality reflects the impossibility of perfectly copying 
quantum information and quantifies the balance between the quality of the copies 
in terms of entropies. It is valid for the cloning of a quantum state in a Hilbert 
space of arbitrary dimensions. Unfortunately, this entropic no-cloning inequality 
is not tight. Indeed, by applying the UCM to an arbitrary qubit {S = 1), it can 
be shown that La = Lb = 2 — log2(3)/2 = 1.21 bits, so that La + Lb = 2.42 bits 
does not saturate the bound in Eq. m- In the next Section, we show how to 
derive a tight inequality for the special case of quantum bits. 



3 Pauli Cloning Machines for Quantum Bits 



3.1 Characterization of a Pauli Channel Using the Bell States 



Consider a quantum bit in an arbitrary state {ip) which is processed by a Pauli 
channel. Thus, the qubit is rotated by one of the three Pauli matrices or remains 
unchanged: it undergoes a phase-flip (ctz), a bit-flip (ctx), or their combination 
(cajCTz = —ioy) with respective probabilities Pz, Px, and Py. (A depolarizing 
channel corresponds to the special case where Px = Py = Pz-) It is convenient 
to describe the operation of such a channel by considering an input maximally 
entangled with a reference system. Defining the four maximally-entangled states 
of two qubits (i.e., the Bell states) as 



|<?±) = ^(| 00 )±| 11 )) 



|^^±) = ^(| 01 )±| 10 )) 



( 11 ) 



we note that the local action of the Pauli matrices on one of these states, say 
l^'*'), yields the three remaining Bell states, nameljfl 



(I0az)|^>+) = \<P~) 

(l(g)a^)|^>+) = |iF+) 

(l®a,az)|^>+) = If-) (12) 

^ This is a minimal requirement. One could also require the outputs A and B to be 
independent, for example. 

^ We use here the convention |0) = | |) and |1) = | J.). 
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Therefore, if the input qubit X of the Pauli channel is maximally entangled with 
a reference qubit R, say if their joint state \'4>)rx is the Bell state 1^“*'), then the 
joint state of R and the output F is a mixture of the four Bell states 

PRY = {l-p) +Pz \<^~){^~\ +Px |'Z'+)('f+| +Py \^-){^-\ , (13) 

with p = p^+py +p^. 

A simple correspondence rule can then be written relating an arbitrary mix- 
ture of Bell state and the associated operation on a qubit |^) by a Pauli channel. 
Start from the mixture 

3 

PRY = {l-p) (14) 

i=l 

where pi < P 2 ^ Pa, P = Pi + P 2 + Pa, and |<^i) stand for the three remaining 
Bell states ranked by increasing weight. It is straightforward to show that the 
operation on an arbitrary state lip) performed by the corresponding channel is 

li’) ^ P = (1 -P-P2) |V’)(V'I + {P2-Pl) O'l|'0_L)(V^_L|cri 

+ {pa -P 2 ) craWiP^Wa + 2(pi +P 2 ) 1/2 (15) 

where \ip±) = —iay\ip*) = axCizlip*) denotes the time-reversed of state \ip). The 
four components in the right-hand side of Eq. USD correspond respectively to 
the unchanged, (rotated) time-reversed, rotated, and random fraction. It is clear 
from Eq. JED that the operation of the channel is state-independent only if 
Pi = P 2 = Pa = p/3, that is, if the time-reversed and rotated fractions vanish. 
Then, we have a depolarizing channel of probability p, i. e., pry is a Werner 
state and Eq. (na) becomes 

IV') ^ P = (1 - 4p/3) \ip){'ip\ + (4p/3) 1/2 (16) 

Thus, the vector characterizing the input qubit in the Bloch sphere is shrunk 
by a scaling factor s = 1 — 4p/3 regardless its orientation, so that the fidelity of 
the channel, / = {'ip\p\'tp) = 1 — 2p/3 = (1 -I- s)/2, is independent of the input 
state. Other channels are necessarily state- dependent. For example, the “2-Pauli” 
channel of probability p (i.e., Px = Pz = p/2 and Py = 0) performs the operation 

IV') ^ P = (1 - 3p/2) \ip){'tp\ -k (p/2) ay\ip±){ip±\ay-i-pl/2 

= (l-3p/2) \iP){iP\ + {p/2) \r)ir\+pi/^ (17) 

while the dephasing channel of probability p (i.e., Pz = p and Px = Py = 0) 
simply gives 

\ip) ^ {l-p)\'ip){ip\+paz\'ip){'ip\<7z (18) 

3.2 Asymmetric Pauli Cloning Machines 

We define an asymmetric Pauli cloning machine as a machine whose two out- 
puts, A and B, emerge from distinct Pauli channels 0 . Thus, if the input X of 
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the doner is fully entangled with a reference R, i.e., the density 

operators pjiA and prb niust then be mixtures of Bell states. Focusing on the 
first output A, we see that a 4-dimensional Hilbert space is necessary in general 
to purify pjiA since we need to accommodate its four (generally nonzero) eigen- 
values. The 2-dimensional space of second output qubit B is thus insufficient for 
this purpose, so that we must introduce an additional system C, which may be 
viewed as an ancilla or the cloning machine itself. A 2-dimensional space for C 
is then sufficient, so that we need to consider a single additional qubit C for the 
cloning machine, as shown in Refs. m- As a consequence, we are led to consider 
a 4-qubit system in order to fully describe the PCM, as pictured in Fig. [3 Before 
cloning, the qubits R and X are in the entangled state |^^), the two auxiliary 
qubits being in a prescribed state, e.g., |0). After cloning, the four qubits R, A, 
B, and C are in a pure state for which pnA and prb are mixtures of Bell states 
{A and B emerge from a Pauli channel). As we shall see, pac happens to be also 
a mixture of Bell states, so that C can be viewed as a third output emerging 
from a Pauli channel. 



Fig. 3. Pauli cloning machine of input X (initially entangled with a reference 
R) and outputs A and B. The third output C refers to an ancilla or the cloning 
machine. The three outputs emerge in general from distinct Pauli channels. 




Instead of specifying a PCM by a particular unitary operation acting on the 
state |'0) of the input qubit X (together with the two auxiliary qubits in a fixed 
state |0)), it is more convenient to characterize it by the wave function \^) babc 
underlying the entanglement of the three outputs with R. So, our goal is to find 
in general the 4-qubit wave functions that satisfy the requirement that the state 
of every pair of two qubits is a mixture of the four Bell states. Making use of the 
Schmidt decomposition of \'1/)rabc for the bipartite partition RA vs BC, it is 
clear that this state can be written as a superposition of double Bell states 

mRA-BC = {v \^+)\^+) + z \^-)\<P~) + X |!F+)|^+) + y \'I'-)\^-)}j,A BC ’ 

(19) 

where x, y, z, and v are complex amplitudes (with |a:p-|-|j/p-|-|zp-|-|z;p = 1). Note 
that the possible permutations of the Bell states in Eq. m are not considered 
here for simplicity. The above requirement is then satisfied for the qubit pairs 
RA and BC, that is, pra = Pbc is of the form of Eq. (tm with = \x\^, 
Py = lyp, pz = \z\^, and 1— p = |r;p. It is important to note that these double 
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Bell states for the partition RA vs BC transform into superpositions of double 
Bell states for the two other possible partitions of the four qubits RABC into 
two pairs {RB vs AC, RC vs AB). For example, the transformation associated 
with the partition RB vs AC is 

|^+)bC = i + |<?-)|<?-) + 

\<1>-)ra |<?-)bc = ^ + |<?-)|<?-) - - \^-)W~)}nB-,AC 

W+)ra W+)bc = i {!<?+)!<?+) - |<?-)|<?-) + - \^-)W~)}nB-,AC 

\^-)ra \'I'-)bc = I - |<?-)|<?-) - + \'1'-)W~)}rb ac 

( 20 ) 

(For the partition RC vs AB, these expressions are similar up to an overall sign 
in the transformation of the state \'R~)ra \'1'~)bc-) This implies that \'P)rabc 
is also a superposition of double Bell states (albeit with different amplitudes) 
for these two other partitions, which, therefore, also yield mixtures of Bell states 
when tracing over half of the system. Specifically, for the partition RB vs AC, 
we obtain 

mRB;AC = W 1 '^+)!'^+) + 2 /' 

( 21 ) 

with 

v' = {v + z + X + y) /2 
z' = {v + z — X — y) j2 
x' = {v — z + X — y) /2 

y' = {v - z- x + y)/2 (22) 

implying that the second output B emerges from a Pauli channel with proba- 
bilities = \x'\^, p’y = |y'P, and p(, = |z'p. Similarly, the third output C is 
described by considering the partition RC vs AB, 

W)rc-,ab = {v" |^+)|<?+) + z" + x" W+)W+) + y” W-)W~)] rc-,ab 

(23) 

with 

v" = {v + z + X — y)/2 
z" = {v + z — X + y) j2 
x" = {v — z + X + y)/2 

y" = {v-z-x-y)l2 (24) 

Thus, Eqs. (12211 and (I24II relate the amplitudes of the double Bell states for the 
three possible partitions of the four qubits into two pairs, and thereby specify 
the entire set of asymmetric Pauli cloning machines. 
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3.3 No-cloning Uncertainty Relation for Quantum Bits 

The complementarity between the two copies produced by a PCM can be shown 
to result in general from an uncertainty principle, much like that associated with 
Fourier transforms. In order to show this, let us construct the two-dimensional 
discrete function am,n with m,n = 0,1: let ao,o = v, ao,i = z, oi^o = x, and 
oi.i = y. Thus, output A emerges from a Pauli channel characterized by the 
probability distribution where p^ = |ao.iP, Px = |«i.oP, Py = 

and |ao,oP is simply the probability that the qubit remains unchanged. Similarly, 
output B can be characterized by a two-dimensional function hm,n defined as 
bo.o = v', &o,i = z! , 6i,o = x' , and = y' . Using this notation, it appears that 
Eq. iU is simply a two-dimensional discrete Fourier transform. 



This emphasizes that if output A is close to perfect (om,n is a peaked func- 
tion) then output B is very noisy {hm,n is a flat function), and conversely. Con- 
sequently, the probability distributions |am,nP and |6m.nP characterizing the 
channels underlying the two outputs A and B cannot have a variance simultane- 
ously tending to zero, giving rise to an uncertainty principle which governs the 
tradeoff between the quality of the copies. 

To illustrate this no-cloning uncertainty principle, let us consider the class of 
asymmetric PCMs whose outputs A and B emerge from (distinct) depolarizing 
channels. Assume that the first output A emerges from a depolarizing channel 
of probability p = 3|xp, i.e., 

PRA = \v\^\<P+){<P+\ + |xp {\<p-){<p-\ + |'F+)('F+| -b \<p-){<Ir-\) , (26) 

with |up + 3|xp = 1. Then, from Eq. we have u' = (u -b 3x)/2 and x' = 
{v — x)/2, resulting in 



Thus, the second output B also emerges from a depolarizing channel of probabil- 
ity p' = 3|x'p = ||u — x\^, implying that both outputs of this asymmetric PCM 
are state-independent and simply correspond to a different scaling of the vector 
characterizing the input qubit in the Bloch sphere. (The third output C emerges 
in general from a different Pauli channel.) The relation between the parameters 
X and x' characterizing the two outputs can be written as 




(25) 






(27) 



\x\^ + Re(a::*a;') -b \x'\^ = ^ 



(28) 



Clearly, the best cloning (minimum values for |x| and |x'|) is achieved when 
the cross term is the largest in magnitude, that is, when x and x' have the 
same phase. For simplicity, we assume here that x and x' are real and positive. 
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Consequently, the tradeoff between the quality of the two copies can be described 
by the no-cloning uncertainty relation 

+ xx' + i , (29) 

where the copying error is measured by the probability of the depolarizing chan- 
nel underlying each output, i.e., p = 3x^ and p' = 3x'^ (with x,x' > 0). Equa- 
tion (P|) corresponds to the domain in the {x, a;')-space located outside an ellipse 
whose semiminor axis, oriented in the direction (1, 1), is l/\/6, as shown in Fig.0 
(The semimajor axis is l/-\/2-) The origin in this space corresponds to a (nonex- 
isting) doner whose two outputs would be perfect p = p' = 0, while to distance 
to origin measures {p-\-p')/3. The ellipse characterizes the ensemble of values for 
p and p' that can actually be achieved with a PCM. It intercepts its minor axis 
at (1/-\/T2, l/V^), which corresponds to the universal cloning machine (UCM), 
i.e., p = p' = 1/4, as discussed below. This point is the closest to the origin (i.e., 
the doner with minimum p-|-p'), and characterizes in this sense the best possible 
copying. The UCM is the only symmetric doner belonging to the class of PCM 
considered here (i.e., doners whose outputs are depolarizing channels); other 
symmetric doners will be considered in Sec. 1 , 8.41 The ellipse crosses the x-axis 
at (1/2,0), which describes the situation where the first output emerges from a 
100%-depolarizing channel (p = 3/4) while the second emerges from a perfect 
channel (p' = 0). Of course, (0, 1/2) corresponds to the symmetric situation. 



Fig. 4. Ellipse delimiting the best quality of the two outputs of an asymmetric 
PCM that can be achieved simultaneously (only the quadrant x,x' > 0 is of 
interest here). The outputs emerge from depolarizing channels of probability 
p = 3x^ and p' = 3x'^. Any close-to-perfect cloning characterized by a point 
inside the ellipse is forbidden. 




The dimensional argument used in Sec. 1^21 strongly suggests that the imper- 
fect cloning achieved by an asymmetric PCM as described above is optimal: a 
single additional qubit C for the doner is sufficient to perform the best cloning, 
i.e., to achieve the minimum p and p' for a fixed ratio p/p' ■ (This is proven rigor- 
ously for the special case p = p' in Ref. 0). Also, introducing a phase difference 
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between x and x' results in a set of PCMs characterized by an ellipse that is 
less eccentric and tends to a circle of radius 1/2 for a phase difference of tt/2. 
Consequently, the no-cloning inequality is saturated when x and x' have 
the same (or opposite) phase. The domain inside the ellipse corresponds then 
to the values for p and p' that cannot be achieved simultaneously, reflecting the 
impossibility of close-to-perfect cloning, and Eq. is the tightest no-cloning 
bound that can be written for a qubit. 

The no-cloning uncertainty relation for qubits can also be reexpressed as 

+ ab+b^ > I , (30) 

where = 4p/3 = 4x^ and = 4p'/3 = 4x'^ denote the depolarizing fraction 
of the channels leading to outputs A and B, respectively^ The corresponding 
ellipse crosses the axes at (0, 1) and (1, 0), while the closest point to the origin is 
(y/l/3, Indeed, the outputs of the UCM emerge from two channels whose 

depolarizing fraction is 1/3. The uncertainty relation associated with the cloning 
of A^-dimensional quantum states (instead of qubits) is investigated in Ref. ^ . 
It is shown there that the cross-term in Eq. becomes 2ab/N, implying that 
the ellipse tends to a circle of radius one at the limit of a large dimension N. 



3.4 Symmetric Pauli Cloning Machines 



Consider now the class of symmetric PCMs that have both outputs emerging 
from a same Pauli channel, i.e., pra = Prb- Using Eq. (12 211 . we obtain the 
conditions 

|up = \v + z + x + yp/4 
\z\^ = \v + z — X — yp/4 
\x\^ = \v — z + X — yp/4 

\y\^^ = \y- z-x + y\’^/4 (31) 



which yields 



V = X + y + z , 



(32) 



where x, y, z, and v are assumed to be real. Equation together with the 
normalization condition, describes a two-dimensional surface in a space where 
each point {x, y, z)0 represents a Pauli channel of parameters Px = x'^, Py = y^, 
and pz = z"^. This surface. 



x^ + y"^ + z"^ + xy + xz + yz = — , 



(33) 



is an oblate ellipsoid E with symmetry axis along the direction (1, 1, 1), as shown 
in Fig. El The semiminor axis (or polar radius) is 1/2 while the semimajor axis 

^ The input qubit is replaced by a random qubit with probability (6^) or left un- 
changed with probability 1 — {1 — b^) in the channel leading to output A (B). 

We only consider here the first octant x,y,z > 0. 
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(or equatorial radius) is 1. In this representation, the distance to the origin is 
Px+Py+Pz, so that the pole (l/vT2, l/-\/T2, l/vT2) of this ellipsoid — the closest 
point to the origin — corresponds to the special case of a depolarizing channel of 
probability p = 1/4. Thus, this particular PCM reduces to the UCM. This 
simply illustrates that the requirement of having an optimal cloning (minimum 
Px + Py + Pz) implies that the doner is state-independent (p^: = Py = Pz)- 



Fig. 5. Oblate ellipsoid representing the class of symmetric PCMs whose two 
outputs emerge from the same Pauli channel of parameters Px = , Py = , 

and Pz = (only the octant a:,?/, z > 0 is considered here). The pole of this 
ellipsoid corresponds to the UCM. The capacity of a Pauli channel that lies 
outside this ellipsoid must be vanishing. 



where the polar angle 9 measures the “distance” from a depolarizing channel 
(0 = 0 implies Px = Py = Pz), while the azimuthal angle </> characterizes the 
distribution among px, Py, and Pz- 

3.5 Universal Cloning Machine 

The optimal symmetric PCM (i.e., the UCM) can be obtained alternatively by 
requiring that the two outputs A and i? of a symmetric doner are maximally 
independent. Using Eqs. (124^1 and we obtain v” = x + z, z” = y + z, 

x" = X + y, and y" = 0. Therefore, we have 



PRC = PAB = \x + z\^ \<P+){<^"^\ + \y + z\^ ){<P \ + \x + y\'^ . (35) 



X 




z 



The parametric equations of ellipsoid E can be written as 




(34) 
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showing that the third output C emerges from a Pauli channel with vanishing py. 
Thus, we want to maximize the joint von Neumann entropy of the two outputs 
A and B, 



S{AB) = -Tr{p AB log pab) = H [\x + z\'^ ,\ y + z\'^ ,\x + y\'^] (36) 

with H[-] denoting the Shannon entropy. It is easy to see that the solution with 
x,y,z > 0 that maximizes S{AB) is x = y = z, that is, the Pauli channel 
underlying outputs A and B reduces to a depolarizing channel. Using Eq. (I;i3ll . 
we get X = y = z = l/-\/T2, so that the wave function underlying the UCM is 

\'1')ra-bc = \^~^)ra\^'^)bc 

+ + \^~)\'1'~)}ra-,bc ( 37 ) 

Consequently 

PRA = PRB = l ^ (|^-)(^'-| + + !'?-)('?- I) (38) 

confirming that A and B emerge both from a depolarizing channel with p = 1/4, 

IV')-^IV')(V’l + i(l/2) (39) 

so that the scaling factor iss=l — 4p/3 = 2/3 while the fidelity of cloning is 
/= l-2p/3 = 5/6 0. For the partition RC vs AB, we obtain 



mRC;AB = ( 4 «) 

implying that the 4-qubit wave function is symmetric under the interchange of 
A and B (or R and C) . It is easy to check that the unitary transformation which 
implements the UCM jS| is 



|0)x |00) ^ )J^\00)ab |0)c + )J^\'1'^)ab |1)c 

|l)x |00) ^ |1 )c + )J^\'I'^)ab |0)c (41) 

Indeed, using Eq. m, we have 

^ y/^(|00)AB |00)i{c + |11 )tb |ll)ijc) 

+ ij^W^)AB{\Ol)RC + |10)_Rc) 



( 42 ) 
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so that the initial state of X (maximally entangled with the reference R) is 
transformed into the wave function Eq. (HI. The latter implies 

PRC = PAB = l (|^+)(<?+| + \<p-){‘p-\ + |'Z'+)(«'+|) (43) 

showing that the joint entropy of the two outputs is maximum (remember that 
the singlet component must vanish). Thus, the third output of the UCM 
emerges from a 2-Pauli channel of probability 2/3. Using Eq. ill vll . we see that 
the corresponding operation on an arbitrary state \^p) is 

\^)^l\r){r\ + l{i/2) (44) 

as noted in Ref. H2|. 

3.6 Related Bound on the Capacity of the Pauli Channel 

An interesting application of the UCM is that it can be used to establish an 
upper bound on the quantum capacity C of the depolarizing channel, namely 
C = 0 at p = 1/4 Extending this result, we show here that the class of 
symmetric PCMs characterized by Eq. (1331) puts a limit on the quantum capacity 
of Pauli channels. Consider a PCM whose outputs emerge from a Pauli channel 
of probabilities Px, Py, and Pz- Applying an error-correcting scheme separately 
on each output of the cloning machine (obliviously of the other output) would 
lead to a violation of the no-cloning theorem if the capacity C{px,Py,Pz) was 
nonzero. Since C is a nonincreasing function of p^,, Py, and pz, for px,Py,Pz < 1/2 
(i.e., adding noise to a channel cannot increase its capacity), we conclude that 

C{px,Py,Pz) = 0 if (x, y,z) ^ E (45) 

that is, the quantum capacity is vanishing for any Pauli channel that lies outside 
the ellipsoid E. In particular, Eq. implies that the quantum capacity van- 
ishes for (i) a depolarizing channel with p = 1/4 {px = Py = Pz = 1/12) (ii) a 
“2-Pauli” channel with p = 1/3 {px = Pz = 1/6, Py = 0); and (iii) a dephasing 
channel with p = 1/2 (px = Py = 0, Pz = 1/2). Furthermore, using the fact that 
C cannot be superadditive for a convex combination of a perfect and a noisy 
channel an upper bound on C can be written using a linear interpolation 
between the perfect channel (0, 0, 0) and any Pauli channel lying on E: 

C < 1 — 2{x^ + y'^ + z'^ + xy + xz + yz) . (46) 

Note that another class of symmetric PCMs can be found by requiring = 
PRC, be., considering C as the second output and B as the cloning machine. 
This requirement implies v = x — y + z rather than Eq. (I32II , which gives rise to 
the reflection of E with respect to the x 2 ;-plane, i.e, y —y. It does not change 
the above bound on C because this class of PCMs has noisier outputs in the 
first octant x,y,z > 0. 
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3.7 Quantum Triplicators Based on the PCM 



Let us turn to the fully symmetric PCMs that have three outputs emerging from 
the same Pauli channel, i.e., pra = Prb = PrCi which corresponds to a family 
of (non-optimal) quantum triplicating machines. The requirement pra = Prc 
implies v = x — y + z, which, together with Eq. (E2J, yields the conditions 
(u = X -|- 2 :) A (y = 0). Incidentally, we notice that if all pairs are required to 
be in the same mixture of Bell states, this mixture cannot have a singlet \^P~) 
component. The outputs of the corresponding triplicators emerge therefore from 
a “2-Pauli” channel {py = 0), so that these triplicators are state-dependent, in 
contrast with the one considered in Ref. 0. (For describing a state-independent 
triplicator, a 6-qubit wave function should be used, that is, the doner should 
consist of 2 qubits.) These triplicators are represented by the intersection of E 
with the x 0 -plane, that is, the ellipse 



-X2=- 



(47) 



whose semiminor axis is l/-\/3 [oriented along the direction (1, 1)] and semimajor 
axis is 1. The intersection of this ellipse with its semiminor axis {x = z = l/'s/b) 
corresponds to the 4-qubit wave function 



ahcd 






(48) 



which is symmetric under the interchange of any two qubits and maximizes 
the 2-bit entropy (or minimizes the mutual entropy between any two outputs 
of the triplicator, making them maximally independent). Equation I14MI thus 
characterizes the best triplicator of this ensemble, whose three outputs emerge 
from a “2-Pauli” channel with p = 1/3 {px = Pz = 1/6). According to Eq. ([771 . 
the (state-dependent) operation of this triplicator on an arbitrary qubit can be 
written as 

If IV') is real, Eq. (ITTII) reduces to the triplicator that was considered in Ref. |T7j . 
The fidelity of cloning is then the same as for the UCM, i.e., / = 5/6, regardless 
the input state (provided it is real). 



4 Conclusion 

We have derived an entropic no-cloning inequality, characterizing the balance be- 
tween the losses of the quantum channels leading to the two outputs of a cloning 
machine. This inequality is valid for states in a Hilbert space of arbitrary dimen- 
sion and implies that the losses of the two channels cannot vanish simultaneously 
as long as the source entropy is non-zero. Then, we have focused on the cloning 
of quantum bits and defined a family of Pauli cloning machines (PCM) . These 
doners, whose outputs emerge from two distinct Pauli channels, generalize the 
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universal cloning machine (UCM). The UCM is a special case (symmetric and 
state-independent) of a PCM. The asymmetric PCMs allowed us to derive a tight 
no-cloning uncertainty relation for quantum bits, characterizing the impossibility 
of (perfectly) copying due to quantum mechanics. Furthermore, we have shown 
that the tradeoff between the quality of the two outputs of a PCM results in 
general from an uncertainty principle akin to the complementarity between po- 
sition and momentum. Specifically, the probability distributions characterizing 
the channels associated with the two outputs cannot be peaked simultaneously, 
this complementarity being simply governed by the relationship between a func- 
tion and its Fourier transform. Finally, using a class of symmetric PCMs, we 
have established an upper bound on the quantum capacity of the Pauli channel. 
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ported in part by the NSF under Grant Nos. PHY 94-12818 and PHY 94-20470, 
and by a grant from DARPA/ARO through the QUIC Program (#DAAH04- 
96-1-3086). N.J.C. is Collaborateur scientifique of the Belgian National Fund for 
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Abstract. We review our recent work on the universal (i.e. input state 
independent) optimal quantum copying (cloning) of qubits. We present 
unitary transformations which describe the optimal cloning of a qubit, 
and we present the corresponding quantum logic network. We also present 
a network for an optimal quantum copying “machine” (transformation) 
which produces N+1 identical copies from the original qubit. Here again 
the quality (fidelity) of the copies does not depend on the state of the 
original and is only a function of the number of copies, N. In addition, 
we present the machine which universally clones states of quantum ob- 
jects in arbitrary-dimensional Hilbert spaces. In particular, we discuss 
universal cloning of quantum registers. 

Keywords: quantum cloning, quantum logic networks, inseparability 

1 Introduction 

The most fundamental difference between classical and quantum information is 
that while classical information can be copied perfectly, quantum information 
cannot. In particular, it follows from the no-cloning theorem P (see also m) 
that one cannot create a perfect duplicate of an arbitrary qubit while preserving 
the state of the original qubit. In the quantum teleportation (see P and references 
therein), one can create a perfect copy of the original qubit but this will be at 
the expense of the eomplete destruction of information encoded in the original 
qubit. In contrast, the main goal of quantum copying is to produce a copy of the 
original qubit which is as close as possible to the original state while the output 
state of the original qubit is minimally disturbed. 

We have shown recently that if one is only interested in producing 
imperfect copies, then it is possible to make quantum clones of the original 
qubit. For example, the copy machine considered by Wootters and Zurek P in 
their proof of the no-cloning theorem produces two identical copies at its output, 
but the quality of these copies depends upon the input state. They are perfect 
for the basis vectors, which we denote as |0) and |1), but, because the copying 
process destroys the off-diagonal information of the input density matrix, they 
are poor for input states of the form (|1) + where g) is arbitrary. 

We have introduced |5iBq a different copying machine, the Universal Quantum 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 235-^21 1999- 
© Springer- Verlag Berlin Heidelberg 1999 
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Copying Machine (UQCM), which produces two identical copies whose quality 
is independent of the input state. In addition, its performance is, on average, 
better than that of the Wootters-Zurek machine, and the action of the machine 
simply scales the expectation values of relevant observables. This UQCM was 
shown to be optimal, in the sense that it maximizes the average fidelity between 
the input and output qubits, by Gisin and Massar ^ and by BruB et al. |^. 
Gisin and Massar have also been able to find copying transformations which 
produce N copies from M originals (where N > M) |^. In addition, we have 
proposed quantum logic networks for quantum copying (cloning) machines 1 71101 , 
and bounds have been placed on how good copies can be II 111 21 . 

In this paper we will firstly review our original ideas on the universal quantum 
copying of a single qubit (Section II). In Section III we will present a quantum 
network describing the UQCM. Secondly, in Section IV we will introduce the 
copying machine which produces N + 1 identical copies from the original qubit 
and present a quantum network which implements it. The quality (fidelity) of 
the copies does not depend on the state of the original and is only a function of 
a number N of copies produced. This machine is formally described by the same 
unitary transformation recently introduced by Gisin and Massar jSj. In Section 

V we will analyze the properties of multiply cloned qubits. Thirdly, in Section 

VI we show how quantum registers (i.e. systems composed of many entangled 
qubits) can be universally cloned. One approach is to use single-qubit copiers 
to individually (locally) copy each qubit. We have shown earlier ^31 that in 
the case of two qubits this local copying will preserve some of the quantum 
correlations between qubits, but as we will show, it does not make a particularly 
good copy of the two-qubit state. As an alternative we propose a copy machine 
which universally clones quantum states in arbitrary-dimensional Hilbert spaces. 
This allow us to discuss the cloning of quantum registers. 

2 Universal Quantum Copying Machine 

Let us assume we want to copy an arbitrary pure state |•f')ao which in a particular 
basis {|0)ao, |l)ao} is described by the state vector |'f')ao 

\W)aa = a\0)ao + P\^)ao', o = sin ■d/2e*‘^; /3 = cosi?/2. (1) 

The two numbers which characterize the state m can be associated with the 
“amplitude” |a| and the “phase” (p of the qubit. Even though ideal copying, i.e., 
the transformation |'f')ao — ^ W)aoW)ax is prohibited by the laws of quantum 
mechanics for an arbitrary state it is still possible to design quantum copiers 
which operate reasonably well. In particular, the UQCM |S| is specified by the 
following conditions: 

(i)The state of the original system and its quantum copy at the output of the 
quantum copier, described by density operators and respectively, 

are identical, i.e., 

~{out) _ -{out) 

raQ rai 



( 2 ) 
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(ii) If no a priori information about the m-state of the original system is 
available, then it is reasonable to require that all pure states should be copied 
equally well. One way to implement this assumption is to design a quantum 
copier such that the distances between density operators of each system at the 
output where j = 0, 1) and the ideal density operator which describes 

the in-state of the original mode are input-state independent. Quantitatively this 
means that if we employ the Bures distance na 

dB{pi,P2) = V2 , (3) 

as a measure of distance between two operators, then the quantum copier should 
be such that 



rfB(pi7‘^;/5if)= const.; j = 0,l. (4) 

(iii) Finally, we would also like to require that the copies are as close as 
possible to the ideal output state, which is, of course, just the input state. This 
means that we want our quantum copying transformation to minimize the dis- 
tance between the output state pi°“*^ of the copied qubit and the ideal state 
Paf'^ ■ The distance is minimized with respect to all possible unitary transforma- 
tions U acting on the Hilbert space TL of two qubits and the quantum copying 
machine (i.e., Ti = Hao ® Hai ® Hx) 

ds(pi7‘);pif) = min{4'^)(p(7‘);pif (j = 0,l). (5) 

Originally, the UQCM was found by analyzing a transformation which contained 
two free parameters, and then determining them by demanding that condition 
(ii) be satisfied, and that the distance between the two-qubit output density 
matrix and the ideal two-qubit output be input-state independent. That the 
UQCM machine obeys the condition Q has only been shown recently |!Siqj . 

The unitary transformation which implements the UQCM is given by 

|0)aolO).-yf|00) 

0-00,1 IT). + y|l+) OqOi I 

|l)aolO).^ y||ll) OqOi li). + /|l+) OqOi I T)., (6) 

where |-l-)aoai = (|10)aoai + |01)aoai)/'\/2- This transformation satisfies the con- 
ditions 121bD . The system labelled by oq is the original (input) qubit, while the 
other system a± represents the qubit onto which the information is copied. This 
qubit is prepared initially in the state |0)ai (it plays the role that a blank piece of 
paper plays in a copier). States of the copy machine are labelled by x. The state 
space of the copy machine is two dimensional, and we assume that it is always 
in the same state \Q)x initially. If the original qubit is in the superposition state 
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m then the reduced density operators of both copies at the output are equal 
[see condition (|2)] and they can be expressed as 

piT^ = J = 0,1 (7) 



where \^±)aj = /3*|0)aj — o;*|l)aj , is the state orthogonal to This implies 

that the copy contains 5/6 of the state we want and 1/6 of the one we do not. 

The density operator given by Eq. (0 can be rewritten in a “scaled” 

form: 






'{id) 



1 - 



J = 0,1, 



(8) 



which guarantees that the distance (0 is input-state independent, i.e. the con- 
dition m is automatically fulfilled. The scaling factor in Eq.0 is sj =2/3 

(j = o,i). 

The Bures distance {j = 0, 1) between the output qubit and 

the ideal qubit is constant and it reads 



dB{piT^;A:f^) = V2 





(9) 



We note, that the idle qubit after the copying is performed is in a state 




where the superscript T denotes the transpose. 

We stress once again that the UQCM copies all input states with the same 
quality and therefore is suitable for copying when no a priori information about 
the state of the original qubit is available. This corresponds to a uniform prior 
probability distribution on the state space of a qubit (Poincare sphere). Cor- 
respondingly, one can measure the quality of copies by the fidelity T, which is 
equal to the mean overlap between a copy and the input state |Ej 

T= j dQaA'!'\pAAA'l')a,. (11) 

where f dfl = dip dd sin i?/47t. It is easy to show that the relation between 
the fidelity iF and the scaling factor is s = 2F — 1. 



3 Copying Network 

In what follows we show how, with simple quantum logic gates, we can copy 
quantum information encoded in the original qubit onto other qubits. The copy- 
ing procedure can be understood as a “spread” of information via a “controlled” 
entanglement between the original qubit and the copy qubits. This controlled 
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entanglement is implemented by a sequence of controlled-NOT operations oper- 
ating on the original qubit and the copy qubits which are initially prepared in a 
specific state. 

In designing a network for the UQCM we first note that since the state 
space of the copy machine itself is two dimensional, we can consider it to be an 
additional qubit. Our network, then, will take 3 input qubits (one for the input, 
one which becomes a copy, and one for the machine) and transform them into 
3 output qubits. In what follows we will denote the quantum copier qubit as b\ 
rather than x. The operation of this network is such, that in order to transfer 
information from the original oq qubit to the target qubit a\ we will need one 
idle qubit bi which plays the role of quantum copier. 

Before proceeding with the network itself let us specify the one and two-qubit 
gates from which it will be constructed. Firstly, we define a single-qubit rotation 
Rj{0) which acts on the basis vectors of qubits as 

Rj{9)\0)j = cos0|O)j -I- sin0|l)j; Rj{9)\l)j = — sin0|O)j -I- cos6>|l)j. (12) 

We also will utilize a two-qubit operator (a two-bit quantum gate), the so-called 
controlled-NOT gate, which has as its inputs a control qubit and a target qubit. 
The operator which i mplements this gate, Pki, acts on the basis vectors of the 
two qubits as follows {k denotes the control qubit and I the target): 

Pfcd0)fe|0)i = |0)fe|0)i; Pfci|0)fc|l)i = |0)fe|l)i; . . 

A.;|l)fc|0)i = |l)fe|l)d Ai|l)fc|l)i = |l)fc|0)i. ^ ^ 

We can decompose the quantum copier network into two parts. In the first 
part the copy (oi) and the idle (6i) qubits are prepared in a specific state 
Then in the second part, the information from the original qubit oq is 
redistributed among the three qubits. That is, the action of the quantum copier 
can be described as a sequence of two unitary transformations 



3.1 Preparation of Quantum Copier 

Let us first look at the preparation stage. Prior to any interaction with the input 
qubit we have to prepare the two quantum copier qubits (oi and bi) in a specific 
state 



1®')^'’’ = ^ (2|00)„i. + |01)„i. + 



1 



(15) 



which can be prepared by a simple quantum network with two controlled-NOTs 
Pki and three rotations R{0j), i.e. 

\^)aZf = RaAd3)Pb,aMO2)Pa,bM0imaMb^, 
with the rotation angles defined as [ 7 ] 



cos 201 = 



V5’ 



cos 202 = 






cos 203 = 



V5' 



(16) 

(17) 
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3.2 Quantum Copying 



Once the qubits of the quantum copier are properly prepared then the copying 
of the initial state of the original qubit can be performed by a sequence 

of four controlled-NOT operations 



\^) 



{out) 

CLQa\h\ 



— Ph\aQPa\aQPaQb\PaQa\ 



\^) 



{in) 

CLQ 



\^) 



(prep) 

ai6i 



(18) 



When this operation is combined with the preparation stage, we find that the 
basis states of the original qubit (oq) are copied as described by Eq. © with 
I t)x = |0)bi and I = |l)bi. 



4 Multiple Copying 



Here we present a generalization of the quantum network (II iSIl to the case when 
a set of N copy qubits Oj {j = 1, N) are produced out of the original qubit ag 

m- 

To find the 1 — > 1 + network we assume the following: 

(1) We assume that the information from the original qubit is copied to JV copy 

qubits ttj which are initially prepared in the state |fV;0)a = |0)ai (here 
the subscript a is a shorthand notation indicating that | 0)a is a vector in the 

Hilbert space of N qubits aj). 

(2) To implement multiple quantum copying we need to associate an idle qubit 
bj with each copy qubit, aj. These N idle qubits, which play the role of the 
copying machine itself, are initially prepared in the state |A^;0)b = |0)f,j ...|0)&„ . 

(3) Prior to the transfer of information from the original qubit, the copy and 
the idle qubits have to be prepared in a specific state 

N 

l^^iprep) ^ ^ + f,\N; k - 1)„] |iV; fc)b, (19) 

A;=0 



where 




fk = 



k 

N-k+1 






(20) 



and \N; k)a are normalized symmetric A(^-qubit state vectors with k qubits in 
the state |1) and {N — k) qubits in the state |0). The states (1 1 t)l) can be obtained 
by performing a sequence of local rotations R and controlled-NOT operations 
analogous to Eo. lTniH [rTlj . Once the copying machine is prepared in the state 
we can start to copy information from the original qubit oq. 

To describe the copying network we firstly introduce an operator Qa^a which 
is a product of the controlled-NOTs defined by Eq. mni) with oo being a control 
qubit and aj (j = 1, ..., N) being targets: 

(1 = P P P 

aodjV'*^ aociA'’ — 1 aoRi* 



( 21 ) 
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We also introduce the operator Qaao describing the controlled-NOT process with 
oo playing the role of the target qubit, i.e. 




Oiv-iao 



( 22 ) 



Now we find the 1 — > 1 + N copying network to be 




(23) 



where the (2A^+ 1) qubit output of the copying process is described by the state 



This last equation describes a simple quantum network when firstly the original 
qubit controls the target qubits of the quantum copier. Then the qubits a and 
b “control” the state of the original qubit via another sequence of controlled- 
NOTs. In this way one can produce out of a single original qubit a set of quantum 
clones. This quantum network realizes the unitary transformation for 1 1 + N 

cloning as introduced by Gisin and Massar 0. 

5 Properties of Copied Qubits 

Using the explicit expression for the output state we find that the original 

and the copy qubits at the output of the quantum copier are in the same state 
described by the density operator 



where the scaling factor depends on the number N of copies, i.e. 



which corresponds to the fidelity T = 2/3+ 1/3(A^+ 1). We see that this result 
for = 1 reduces to the case of the UQCM discussed in Section III. We also 
note that in the limit N —> oo, i.e. when an infinite number of copies is simul- 
taneously produced via the generalization of the UQCM, the copy qubits still 
carry information about the original qubit, because their density operators are 
given by the relation 




(24) 




(25) 




j = 0,1,..., oo. 



(27) 



which corresponds to the fidelity T = 2j?>. This is the optimal fidelity achievable 
when an optimal measurement is performed on a single qubit unEi. From this 
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point of view one can consider quantum copying as a transformation of quan- 
tum information into classical information 0. This also suggests that quantum 
copying can be utilized to obtain novel insights into the quantum theory of mea- 
surement [e.g., a simultaneous measurement of conjugated observables on two 
copies of the original qubit; or a specific realization of the generalized (POVM) 
measurement performed on the original qubit] . 

Comment 1 

We note that idle qubits bj after the copying is performed are always in the state 

4r’ = 5(4“’)^ + 5i. J = (28) 

irrespective of the number of copies created from the original qubit. 

Comment 2 

Using the Peres- Horodecki theorem mm we can conclude that an arbitrary pair 
Pa^ar, out of A^-|- 1 of Copied qubits at the output of the copier is inseparable only 
in the case = 1. In this case one of the eigenvalues of the partially transposed 
operator is negative, which is the necessary and sufficient condition for 

the inseparability of the matrix For > 1 all pairs of copied qubits at 

the output of the quantum copier are separable. 

6 Cloning of Quantum Registers 

In what follows we will propose a copy machine which universally copies higher 
dimensional systems. We shall be particularly interested in how the quality of 
the copies scales with the dimensionality, M, of the system being copied. What 
we find is that the fidelity of the copies decreases with M, as expected, but, 
somewhat surprisingly, does not go to zero as M goes to infinity. 

Let us consider a quantum system prepared in a pure state which is described 
by the vector 



M 

( 29 ) 

i=l 

in an M-dimensional Hilbert space spanned by M orthonormal basis vectors 
\'l^i)aa (* = I,...,A^). The complex amplitudes ai are normalized to unity, i.e. 
^ = 1. In particular, one can consider M = 2™ where m is the number of 

qubits in a given quantum register. One can generalize the no-cloning theorem 
which has been proven for spin-1/2 particles (qubits) by Wootters and Zurek [Q 
for arbitrary quantum systems. That is, there does not exist a unitary transfor- 
mation such that the state given in Eq. (12 PI) can be ideally cloned (copied), i.e. 
it is impossible to find a unitary transformation such that \^)ao — ^ |^^)aol^)ai 
for an arbitrary input states. 

Following our previous discussion we can ask whether a universal cloning 
transformation exists which will generate two imperfect copies from the original 
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state, |^)oo- The quality of the cloning should not depend on the particular 
state (in the given Hilbert space) which is going to be copied. This input-state 
independence (invariance) of the cloning can be formally expressed as 







( 30 ) 



where paf^ = |^^)aoao(^l is the density operator describing the original state 
which is going to be copied. This scaling form of Eq. Plj) guarantees that the 
Bures distance between the input and the output density operators is input- 
state independent. 

The quantum copying machine we shall use is itself an M dimensional quan- 
tum system, and we shall let \Xi)^ {i = 1, M) be an orthonormal basis of the 
copying machine Hilbert space. This copier is initially prepared in a particular 
state \X)x- The action of the cloning transformation can be specified by a uni- 
tary transformation acting on basis vectors of the tensor product space of the 
original quantum system the copier, and an additional M-dimensional 

system which is to become the copy (which is initially prepared in a state |0)ai). 
We have found the transformation of the basis vectors \^i)ao 



M 

+ \Xj)^; ( 31 ) 



(where * = 1, with the coefficients 

c = — ; d = — , (32) 

^/2(MTVj V2(M+1) 

such that the input-state independence of cloning is satisfied. That is, the clones 
have density operators given by En . (1,31 )ll with the scaling factor 

2 , (n/r o^J2 (M -I- 2) 

“ = = 2(mT1) ■ 

If M = 2, then the transformation reduces to the copying transformation for 
qubits given by Eq.(0). From earlier results of Gisin and Massar 0 the optimality 
of the transformation JSH) for M = 2 directly follows. At the moment we are 
not able to prove rigorously that the cloning transformation JSH) is optimal for 
arbitrary M > 2. Nevertheless, we have performed numerical tests which suggest 
that the cloning transformation is optimal. 

We note that the scaling factor (13.311 . which describes the quality of the copy, 
is a decreasing function of M . This is not surprising, because a quantum state 
in a large dimensional space contains more quantum information than one in a 
small dimensional one (e. g. a state in a 4 dimensional space contains information 
about 2 qubits while a state in a 2 dimensional one describes only a single qubit), 
so that as M increases one is trying to copy more and more quantum information. 
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On the other hand, it is interesting to note that in the limit M oo, i.e. in the 
case when the Hilbert space of the given quantum system is infinite dimensional 
(e.g. quantum-mechanical harmonic oscillator), the cloning can still be performed 
efficiently with the scaling factor equal to 1/2. 

In order to confirm that the quality of the copies which the copying trans- 
formation El produces is input-state independent (i.e., all states are cloned 
equally well) we evaluate the Bures distance (0. In our particular case we find, 
that the distance between and pa^'^ depends only on the dimension of the 

Hilbert space M, but not on the state which is cloned, i.e. 

= . ( 34 ) 

The Bures distance given by Eq. ll,44> is maximal when states in the infinite- 
dimensional Hilbert space are cloned, and in that case we find 

lim dB(pir),p(f) = V 2 -V 2 . (35) 

M— >CXD K V 

This means that even for an infinite-dimensional system, reasonable cloning can 
be performed, which is reflected in the fact that the corresponding scaling factor 
s is equal to 1/2. 

6.1 Local vs. Nonlocal Cloning 

Finally, we compare two methods of copying quantum registers. In particular, 
we shall consider cloning an entangled state of two qubits. We assume that the 
two qubits are prepared in the state 

\^)aobo = Q(|00)aubu -|- /3|ll)a(,f)o , (36) 

where, for simplicity, we have taken a and /3 to be real, and o? f? = 1. First, 
we shall consider the case in which each of the two qubits oq and bg is copied 
locally by two independent quantum copiers E). Each of these two copiers is 
described by the transformation (EJ with M = 2. Next, we shall consider a 
nonlocal cloning of the two-qubit state E3) when this system is cloned via the 
unitary transformation (El) with M = 4, i.e. the doner in this case acts non- 
locally on the two qubits. Our chief task will be to analyze how inseparability is 
cloned in these two scenarios, but we shall also examine the quality of the copies 
which are produced. From the Peres-Horodecki theorem it follows that the state 
m is inseparable for all values of c? such that 0 < < 1. 

Firstly, let us suppose that the two original qubits oq and bg are cloned inde- 
pendently (locally) by two independent local doners Xj and Xu, each described 
by the transformation ED with M = 2. The doner X: (Xjj) generates out of 
qubit ag (bg) two qubits ag and oi (bg and bi). After we perform trace over 
the two doners we obtain a four-qubit density operator which also de- 

scribes two nonlocal two-qubit systems, i.e. Pagbi and Paibg- These two two-qubit 
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systems are the clones of the original two-qubit register (TTHl . We note that these 
density matrices cannot be expressed in the scaled form (EH, and that the qual- 
ity of the copies depends on the input state. From the Peres-Horodecki theorem 
we immediately find that the density operators Paobi and Paibo are inseparable 
if 



1 

2 



2 

16 - “ 



1 

< - 

- 2 



IF' 



(37) 



This proves that for a restricted set of pure two-qubit states lldbll satisfying the 
condition lldYll . it is possible to perform a local cloning such that the original 
inseparability of entangled pair of qubits is (partially) preserved. 

Secondly, let us see what happens when we copy the entire two-qubit register 
at once. We would like to determine whether the set of original two-qubit states 
m, which after the cloning exhibit inseparability, is larger (i.e., the restriction 
of the form given in Eq. (E3) is weaker) than when a local cloning is performed. 
To do so, we introduce four basis vectors \^i) = |00); \'p 2 ) = |01); IlFa) = |10); and 
1 ^/ 4 ) = | 11 ), so that the original two-qubit state in Eq. (1^ is expressed as |^) = 
a|<Fi) -I- P\'1'4). The copying is now performed according to the transformation 
(EU with M = 4. We find that each of the two pairs of two-qubit copies at the 
output of the copier is described by the same density operator, i.e. Paobi = Paibo- 
Moreover, the fidelity of copying is input-state independent, and the quality of 
cloned registers is higher than that in the case of local cloning. Again, using the 
Peres-Horodecki theorem we find that the density operators Pao&i Paibo are 
inseparable if 



1 

2 



V2 



< < 



V2 

3 ■ 



(38) 



We conclude that quantum inseparability can be copied better (i.e. for much 
larger range of the parameter a) by using a nonlocal copier than when two local 
copiers are used. 



7 Conclusions 

We have presented the universal optimal quantum copying (cloning) machine 
which optimally clones a single original qubit to -I- 1 qubits. We have found 
a simple quantum network which realizes this quantum copier. In addition we 
have presented a universal doner for quantum registers. We have numerically 
tested the optimality of this doner, but a rigorous proof has yet to be found. 

Quantum copiers can be effectively utilized in various processes designed for 
the manipulation of quantum information. In particular, quantum copiers can 
be used for eavesdropping I2DI, they can be applied for realization of generalized 
(POVM) measurements [213 > they can be utilized for storage and retrieval of 

information in quantum computers m- 
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Abstract. The newfound importance of “entanglement as a resource” 
in quantum computation and quantum communication behooves us to 
quantify it in as many distinct ways as possible. Here we explore a new 
quantification of entanglement of a general (mixed) quantum state for a 
bipartite system, which we name entanglement of assistance. We show it 
to be the maximum of the average entanglement over all ensembles con- 
sistent with the density matrix describing the bipartite state. With the 
help of lower and upper bounds we calculate entanglement of assistance 
for a few cases and use these results to show the surprising property 
of superadditivity. We believe that this may throw some light on the 
question of additivity of entanglement of formation. 



1 Introduction 

Much of the preoccupation of classical information theory is in making the 
correlation between two ends of a communication channel — that is, between 
a sender and a receiver — as high as possible. This is what (classical) commu- 
nication is about. In contrast, this is only part of the story for the fledgling 
field of quantum information theory. The quantum world brings with it a new 
resource that senders and receivers can share: quantum entanglement, the stuff 
Einstein-Podolsky-Rosen pairs and Bell-inequality violations are made of. This 
new resource, of all the things in quantum information theory, is the most truly 
“quantum” of the lot. It is a resource because of the myriad ways in which it is 
starting to be used. The list of its applications already includes quantum key dis- 
tribution quantum-state teleportation |2|, entanglement-enhanced classical 
communication |2j, error-correction for quantum computers P], entanglement- 
assisted multi-party communication [S|, and better control of atomic frequency 
standards among other things. The list grows every day. 

The newfound importance of “entanglement as a resource” behooves us to 
quantify entanglement in as many distinct ways as possible. To this end, we 
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explore a new quantification of the entanglement of a general (mixed) quantum 
state for a bipartite system. This quantity we dub the entanglement of assistance. 
It is something of a dual notion to that of the entanglement of formation studied 
by Bennett, DiVincenzo, Smolin and Wootters jjj. It can be motivated in the 
following way. 

Consider three players Alice, Bob, and Charlie, who jointly possess many 
copies of a tripartite quantum system each described by the pure state 
It follows that Alice and Bob, considered in isolation, possess many copies of the 
(generally mixed) state Suppose now that Alice and Bob 

need to use their shared quantum system for one of the tasks above, teleportation 
for instance. However, very unfortunately, Charlie is not immediately available 
to pass his systems over for their use. Since their state may not be very pure or 
entangled for that matter, they may be seriously impaired by this. How might 
Charlie help in spite of his constraint? 

One thing he can do is perform a measurement on each system that he pos- 
sesses so that Alice and Bob will have pure states conditioned on his outcomes. 
If he transmits the classical information so obtained, they may be able to use 
it to their benefit. For instance, if several of his measurements reveal that Alice 
and Bob actually possess the pure state then by the result of Bennett, et 

al. [S| , they can convert these quantum states into maximally-entangled singlet 
states at a rate of S{p^) = — tr(p* log 2 p*), where p^ = trB|'F*®)('F*®|. In general 
then, for a measurement that creates the states I'F*®) with probabilities pi, i.e., 
an ensemble £ = {pi, the average rate for conversion to singlets 

will be 

- Pi) ■ ( 1 ) 

i i 

Since Charlie is a friend of Alice and Bob, he should choose his measure- 
ment so that this average rate is maximized. By a theorem of Hughston, Jozsa, 
and Wootters 0 this maximization can be taken over all possible ensembles S 
consistent with p^^ in the sense that 

i 

Therefore the resulting maximal rate is a number intrinsic to p^^ , depending 
upon no further details of the state This is the entanglement of assistance 

A{p^^) for p^^-. 

A{p^^) = maxif(f ) . (2) 

This quantity seems to be dual to the entanglement of formation, which is defined 

by 

F{p-^)=rn^E{£) . (3) 

In this paper, we demonstrate several properties possessed by entanglement 
of assistance, calculating it explicitly for some specialized cases. We exhibit a 
general upper bound, a particular upper bound specialized to the two-qubit 
case and a lower bound for diagonal density matrices, on the entanglement of 
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asistance. With the aid of the last two, we prove a fairly surprising property: the 
entanglement of assistance is in some cases superadditive. That is, there exist 
density operators p* ® ^r which 



This means that if Charlie performs entangled measurements on his separate 
copies of the system, he can be more effective in helping Alice and Bob than he 
would have been otherwise. 

2 Properties and Bounds 

The average entanglement of an ensemble is invariant under local unitary trans- 
formations and non-increasing under general local operationtQ with classical 
communication m, so entanglement of assistance also has these properties. 
In addition to this it is easy to see that since there is a maximization over 
all possible pure-state realizations of the density matrix, the entanglement of 
assistance A is concave. That is. 



We note here that the entanglement of assistance is the least of all concave 
functions coinciding on pure states with their partial entropy i.e. entropy as seen 
from one side, while entanglement of fomation is the greatest convex function 
with that property. This may be seen as a sign of the mentioned duality between 
the two. 

Since we do not have a general formula E| for entanglement of assistance of 
a given density matrix, upper and lower bounds are of great importance. We 
have found a few bounds which have helped us calculate the entanglement of 
assistance in some cases and have been of immense value in proving its super- 
additivity. 

2.1 Upper Bounds 

We have found an upper bound — the entropic bound — which works in general, 
and another upper bound — the fidelity bound — which works for the simplest 
case of one qubit on Alice’s side and one on Bob’s side. 

^ Local operations are those that Alice and Bob can perform on their own part of the 
system with possible synchronization using classical messages. However, they may 
not send each other quantum systems nor can they come together and perform a 
joint quantum operation on the system. 

^ We will present a formula for a non-trivial class of rank two density matrices in a 
forthcomming publication. 



A{pr pr) > mpd + mpD ■ 



( 4 ) 




( 5 ) 
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Entropic Bound: The entanglement of assistance is never greater than the 
minimum of the partial entropy as seen by Alice or Bob: 

A(p) < Min[S'(trA/9),5'(trBp)] . (6) 

To prove this assume that A{p) is achieved by the pure-state ensemble £ = 
{pi, Ui = I |}, then by the concavity of Von Neumann entropy S we have, 

-S'(trAp) > ’^PiS{tTAlIi) = A{p) . (7) 

i 

The same inequality holds with trace over A replaced by trace over B. Combining 
them we get the entropic bound. 

Since the Von Neumann entropy is additive and concave the bound is also ad- 
ditive and concave. Thus for states which saturate this bound the entanglement 
of assistance must be additive. We can also see from the results of section Q that 
the entropic bound is zero if and only if the entanglement of assistance is zero. 
The bound obviously agrees with entanglement of assistance for pure states. It is 
easy to show 0 that for the case of one qubit each on Alice’s and Bob’s side, the 
entropic bound is maximum (1 ebit) if and only if the entanglement of assistance 
is also maximum. 



Fidelity Bound: For the case of one qubit each with Alice and Bob, we can 
get a bound that is sometimes stronger than the entropic bound. It says that 
the entanglement of assistance cannot be greater than the fidelity of the density 
matrix relative to its complex conjugate in the magic basis 0 i.e., 

A{p) < F{p, p) (8) 



where the ~ — Hill-Wootters tilde — rep resents complex conjugation in the magic 
basis cni and F{p, a) = is the fidelity El that is, the square root 

of the transition probability El- To prove the bound the main thing we need is 
a classical inequality on the binary Shannon entropy [1 ,Sj . It is. 



H 2 {x) = — a:log 2 a: — (I — a:) log 2 (I — a;) 
< 2\J x{\ — x) . 



(9) 

( 10 ) 



Equality is achieved in this if and only if x = 0, 1. The other facts we need 

are that for a pure state 77^ in a decomposition of p, the largest eigenvalue, 
Ai(pf = treili), of its reduced density operator is given in terms of the Hill- 
Wootters tilde by 





1 - tr(77i77i) 



( 11 ) 



3 

4 



We show this in a forthcomming publication. 



The magic basis is: 
|e4) = ^2^;|e,) 



oo>-|ii> . 

-iV2 ’ 



A _ |01) + |10) . 

‘’3/ - _iv^ > 



I 64) 



| 01 )-| 10 ) 
v/2 
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and, that the fidelity function is doubly concave in its two arguments and 






( 12 ) 



Putting all this together: for any ensemble £ = consistent with p 

average entanglement is 



E{£) = J2p^S{pt) (13) 

i 

= ^fti?2(Ai(pf)) (14) 

i 

< 2^p*Y^Ai(pf)(l - Ai(pf)) (15) 

i 

= y^K\Ar(-n^ii7^) (16) 

i 

= Y,p,F{n,,fii) (17) 

i 

<F{p,p). (18) 



The bound in Eq. (0 follows immediately. 

Note the conditions for saturating this bound. In particular, in going from 
Eq. (ITU to 111 511 ■ we see that this new bound can be achieved only if p has a 
decomposition consisting only of maximally entangled and completely unentan- 
gled states. Thus the bound can’t be tight generally — consider simply a p that 
is pure but not itself maximally entangled. However, we will see in section (Bt 
that this bound is sometimes better than the entropic bound. 



2.2 Lower Bounds 

To get lower bounds we use the fact that the average entanglement of any pure- 
state realization of the density matrix gives us a lower bound on the entanglement 
of assistance. Using this idea we can come up with lower bounds by making sys- 
tematic pure-state decompositions of the density matrix. For example, average 
entanglement of the eigenvector decomposition of the density matrix is a lower 
bound. 

Applying the above idea to density matrices diagonal in a product basis 
we get a lower bound which we name diagonal lower hound. Consider the case 
of only one qubit on either side so that the product basis can be written as 
{|00), |01), |10), |11)}. The density matrix in this product basis looks like, 

/ai 0 0 0 \ 

0 «2 0 0 
0 0 03 0 
\ 0 0 0 04 / 



( 19 ) 
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and the bound is, 

OC 1 Q!9 

^(p) > (ai + 0!4)-ff2( ; ) + («2 + a3)i?2( ; ) • (20) 

This follows from the pure state realization oi p\ £ = {pi, \ipi) | j = 1, with 

{Pi} = {(«1 + «4)/2, (oi + 04)/2, (o2 + 03)/2, (o2 + 03)/2} 

\ipi) = (-\/oi/(oi + 04), 0, 0, + 04)) 

\i>2) = (-\/oi/(oi + O4),0,0,--\/o4/(oi + 04)) 

IV'a) = (0, ^02/(02 + 03), ^03/(02 + 03), 0) 

IV'4) = ( 0 , \/ 02/ (02 + 03), —\Ja-il (02 + 03), 0) . (21) 

Numerical results using optimization algorithms suggest that this is indeed the 
exact formula for entanglement of assistance for these density matrices. Also 
when this result saturates any of the upper bounds we know that it is the answer. 

The bound can easily be generalized to higher dimensionsU]. Another thing to 
note here is that off-diagonal terms between {| 00), 1 11)} and {| 01), 1 10)} don’t 
change the lower bound because we can still use a similar pure state decompo- 
sition as before, with slightly modified probabilities. 



3 Examples 

Let us start by characterizing the states that have zero entanglement of as- 
sistance. We find that there is a very simple characterization of these states 
as, states whose density matrix is pure on at least one side. That is, either 
p = |V’a)(V’a| (g) pb or p = PA® \iPb){'^b\- 

To prove this result first note that since the entropic (upper) bound (0 is 
zero for such states, these states have zero entanglement of assistance. Let us 
now prove that only such states can have zero entanglement of assistance. Since 
entanglement of assistance is zero, every valid pure-state decomposition of p must 
consist exclusively of product states. (For otherwise the average entanglement 
of that ensemble will be necessarily greater then zero.) So let us focus on an 
eigendecomposition of p: this is an ensemble £ = {A^, \ai)\Pi)}. (The Xi ^ 0 are 
the probabilities for the associated states.) 

Now, by the Hughston-Jozsa-Wootters theorem jSj, any “unitary reshuffling” 
of this ensemble is also a valid ensemble. Therefore let us consider a new ensemble 
£' = {pi, \ipi)} that is exactly the same as the old one, except in the first two 
elements. Namely, 

y/plltpi) = cosPv^lai)!/?!) + sin6'\/^|a2)|/32) (22) 

y/p 2 \ijj 2 ) = sin0-yAj'|ai)|/3i) - cos 9\/^\c(2)\P2) ■ (23) 

® This will be presented in a forthcomming publication. 
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Because the ensemble £ is an eigenensemble, we must have either {a\ \u 2 ) = 0 
or {(3i\(32) = 0 or both. First note that it cannot be the case that both inner 
products vanish. For otherwise, there will be at least one value of 0 for which 
l^i) and \ip 2 ) will be entangled states. Consequently, let us look at one of the 
other possibilities: namely, |ai) and \u 2 ) are othogonal, but (/3i|/32) ^ 0. Then 
we must have \(3\) = e*'^|/32)- For otherwise, again — for at least one value of 9 — 
both |'0i) and |'02) will be entangled states. The argument goes analogously if 
|/3i) and |/32) are othogonal. That is to say, \'ipi) and \'ip 2 ) must indeed be product 
states but with either identical left factors or identical right factors. 

Therefore, just to say it again (but now ignoring phases, since they could 
have been in the “unitary reshuffling” anyway), we must have either |ai) = \u 2 ) 
or |/3i) = 1/32). Locking one of these possiblities in, we can go through exactly the 
same argument for similar “unitary reshufflings” of other pairs of eigenstates. 
Consistency then gives that we must find either 

P=Ha){'4>a\® Pb or P = PA |V’b)('0b| • (24) 

This completes the proof. 

Let us now look at the states with the maximum value 0 for entanglement 
of assistance. This is the set of all possible mixtures of maximally entangled 
pure-states. For the simplest non-trivial case of one qubit on either side, this is 
equivalent to the set of ‘T’ states of Horodecki et al. nnni or equivalently the 
set of real Q density matrices in the magic basis ma. 

Next we consider some non-extremal examples. Consider states like p = 
Diagonal[o!, 0, 0, 1 — a] in a product basis. The entropic bound (0 says A{p) < 
H 2 {a) and the diagonal lower bound (EIll gives us A{p) > H 2 {a) so that 

A{p) = H2{a). 

Let us look at a more complicated example, say p = Diagonal[l/3, 1/3, 1/3,0] 
in a product basis. The diagonal bound gives us A (p) > 2/3. For the fidelity 
bound, we first calculate p =Diagonal[0,l/3,l/3,l/3] and by the fidelity bound 
(0 we see that A{p) < 2/3. Thus A{p) = 2/3. Notice that the entropic bound for 
this case is just 7F2(l/3) ~ 0.918 ebits. Since it does not saturate the entropic 
bound this state is a good candidate for superadditivity. Let us turn to that 
next. 

4 Superadditivity 

It has been speculated (and is indicated numerically for a few cases) that the 
entanglement of formation is additive m- This property is very important if 
it is to be used as a measure of entanglement. Since entanglement of assistance 
looks dual to entanglement of formation it comes as a bit of a surprise then that 
the entanglement of formation is superadditive. By this we mean 

A{pt^ ® P^'') > A{pt^) + 4(pAB) (25) 

® The maximum value is Log 2 (Min[NA, Nb]) ebits. Here, Na and Nb are the dimen- 
sions of the Hilbert spaces on Alice’s and Bob’s side respectively. 

^ upto a phase. 
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for some density matrices and P 2 ^. Since the entropic bound is additive, 
superadditivity is possible only for the states which do not saturate the bound. 

The density matrix p =Diagonal[l/3, 1/3, 1/3,0] which we have just seen in 
section 0 has an entanglement of assistance A = 2l2>. Recall that this state 
does not saturate the (additive) entropic bound. So it can show superadditivity 
and in fact it does! If A were additive: A{p® p) = 2A{p) = 4/3. However p® p 
can be realized as an ensemble £ which has an average entanglement E « 1.5506. 
(This ensemble is shown in detail in appendix^. This being a lower bound on 
A{p ® p) it shows that entanglement of assistance is superadditive. 

Superadditivity of entanglement of assistance means that if Alice, Bob, and 
Charlie have two copies of the same system then Charlie can give Alice and Bob 
more entanglement if he makes an entangled measurement on the two copies of 
the system. 

Since entanglement of assistance and entanglement of formation are dual to 
each other in the sense of replacing a maximization by a minimization, we would 
expect the superadditivity of the entanglement of assistance to tell us something 
about the additivity of the entanglement of formation. We have not been able 
to see a connection as yet but we note here a result m 

A{p) - F{p) < S{p) - I S'(trAp) - S'(trBp) | , (26) 

which connects entanglement of assistance and entanglement of formation. Thus 
in addition to its own physical significance, entanglement of assistance may turn 
out to be a useful tool to study the problem of additivity of the entanglement 
of formation. 

5 Conclusions 

In this paper we have introduced a new measure of entanglement — entangle- 
ment of assistance — and studied its properties and found upper and lower 
bounds for it. We have proved that it is superadditive meaning that, entangled 
measurements by Charlie give more entanglement to Alice and Bob. We also note 
here that the superadditivity of entanglement of assistance may throw some light 
upon the additivity question of the entanglement of formation. 
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A Superadditivity 

Here we look at the ensemble which gives us an average entanglement more 
than the additive value of 4/3 for the density matrix discussed is section 0). 
The ensemble written in the product basis {| OOaOOb), | OOaOIb), | HaUb)} 
is, £ = {pi, \ipi),i = 1, .., 12} where. 



{Pi} — Q^ij Qfi, ai, oi, ai, 02, 02, 02, 02, a2, 02 ) 

\tpi) = (0, 0, 0 , a, 0, 0, 0, 0, 0, 0 , a, 0, 0, 0) 

11 / 2 ) = (0, 0, 0 , a, 0, 0, 0, 0, 0, 0, a, 0, 0 , 0) 

I1/3) = (0, 0, 0, a, 0, 0, 0, 0, 0, 0, -a, 0, 0, 0) 

\ip 4 ) = ( 0 , 0 , 0 , a, 0 , 0 , 0 , 0 , 0 , 0 , -a, 0 , 0 , 0 ) 

I 1 / 5 ) = {d, 0, 0, 0, 0, 0, b, 0, 0, b, 0, 0, 0, 0, 0, 0) 

\^e) = {d, 0, 0, 0, 0, 0, -b, 0, 0, -b, 0, 0, 0, 0, 0, 0) 

I 1 / 7 ) = (0, C, 0, 0, 0, 0, b, 0, c, 0, 0, 0, 0, 0, 0, 0) 

\tps) = (0, c, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0) 

Itjjg) = (0, c, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0) 

1 ^ 10 ) = (0, 0, c, 0, c, 0, 0, 0, 0, b, 0, 0, 0, 0, 0, 0) 

IV’ii) = (0, 0, c, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0) 

1 ^ 32 ) = (0, 0, c, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0) 



and. 



a = 6 — 



5 + V7 

18 



24 







oi = ■ 



d = 




2 



02 = 



3(5 + V7) 



0.0872. 



(27) 
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The entanglements for the pure-states forming the ensemble £ are, 



{Ei} 



{Ect , Ea 5 E(^ , Ef^ , Ep , Ep , E^ , E^ , E^ , E^ , E^ , E ^ ) 



( 28 ) 



where, 



Ed = -^ 2 ( 0 ^, b^) ~ 1.8824 ebits , 

« 1.1834 ebits , 

^^7 = H 2 {c^, 6^, c^, 0) « 1.4605 ebits , 

and average entanglement of this ensemble is, 

E{£) = 4aiEa + 2a\Ep + Qa 2 E^ « 1.5506 ebits. (29) 

To find this ensemble we use the following procedure: First, we use a nu- 
merical optimization program to find the ensemble which has the maximum 
average entanglement. Then using the structure of this ensemble, guessing the 
right phases and using the constraint that the ensemble must give us p (£> p we 
get a quadratic equation which we solve to find the values for a, b, c, d, a\ and 
CX2- 

Note that according to our numerical results, the ensemble presented here 
achieves the maximum average entanglement consistent with the density matrix. 
This suggests that this is the entanglement of assistance for it. 
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Abstract. An investigation of Einstein’s “physical” reality and the con- 
cept of quantum reality in terms of information theory suggests a solution 
to quantum paradoxes such as the Einstein-Podolsky-Rosen (EPR) and 
the Schrodinger-cat paradoxes. Quantum reality, the picture based on 
unitarily evolving wavefunctions, is complete, but appears incomplete 
from the observer’s point of view for fundamental reasons arising from 
the quantum information theory of measurement. Physical reality, the 
picture based on classically accessible observables is, in the worst case 
of EPR experiments, unrelated to the quantum reality it purports to re- 
flect. Thus, quantum information theory implies that only correlations, 
not the correlata, are physically accessible: the mantra of the Ithaca 
interpretation of quantum mechanics. 



1 Introduction 



The concept of “physical reality” as championed by Einstein Q — the postulate 
that the objective state of a system is specified by a set of real-valued parameters 
independently of our knowledge of them — has been an object of contention ever 
since the inception of quantum theory (see, e.g., j‘2l,'II4l,5lfil7j 1 . The most prevail- 
ing views assert either that the “quantum reality” suggested by wavefunctions 
and non-local correlations is only a mathematical construction necessary for a 
consistent theory (Bohr’s view) , or else that physical reality is deterministic but 
incompletely described by quantum mechanics (Einstein’s view). A popular in- 
terpretation of the latter view is that physical reality is obscured by inaccessible 
hidden variables 0, a stance that appears to be discredited by the violation of 
Bell’s inequalities in quantum mechanics |0|. Bohr’s view of complementarity, on 
the other hand, assigns a special status to classical physics as an essential ingre- 
dient in measurement since it requires the measurement device to be classical. As 
recognized by von Neumann mg, this undermines the foundations of quantum 
mechanics as a complete and consistent theory. Here, we suggest that Einstein 
realism and Bohr’s complementarity principle can be reconciled within a frame- 
work that consistently describes the concept of information in quantum mechan- 
ics. This is exemplified by the quantum information theoretic treatment of the 
Einstein-Podolsky-Rosen (EPR) experiment [H and the Schrodinger-cat para- 
dox which has recently attracted increasing attention (see, e.g., |I3)- We 
propose that, in general, the perceived physical reality and quantum reality can 
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be disjoint, that is, the result of a quantum measurement conceivably might not 
carry any information — in the sense of Shannon theory na — which would allow 
the observer to infer the state of the measured system. While counterintuitive, we 
shall show that this picture is a direct consequence of an information-theoretic 
reinterpretation of quantum measurement. Moreover, such a view effortlessly re- 
solves the EPR paradox which has inspired the discussions on reality, as well as 
other quantum paradoxes rooted in the measurement problem. 

The gedankenexperiment that constitutes the EPR paradox was created by 
Einstein, Podolsky, and Rosen to demonstrate their dissatisfaction with “un- 
knowables” Q. In that experiment, it appears that two complementary vari- 
ables (such as position and momentum) are in principle measurable by exploit- 
ing the quantum correlations between the two particles, in contradiction with 
Heisenberg’s uncertainty principle. Their conclusion, namely that the quantum 
mechanical description of reality must therefore be incomplete, was based on a 
criterion for reality which they considered “reasonable” (see below). This crite- 
rion was faulted by Bohr m in his reply to the EPR paper, insisting rather that 
physical variables are never independent of the way they are measured owing to 
the complementarity principle, and therefore that measurements do not confer 
reality to properties of quantum objects. We shall show here, using quantum 
information theory only, that, while indeed an element of reality is not created 
for the measured quantum system, the result of a quantum measurement cre- 
ates an element of reality for the result of another measurement, i.e., it allows 
you to predict the state of another measurement device without revealing the 
state of the quantum system itself. Thus, physical reality is reflected in cor- 
relations between classical objects only. This view, which we arrived at from 
a quantum information-theoretic examination of quantum measurement fT^ 
essentially coincides with Mermin’s “Ithaca Interpretation of Quantum Mechan- 
ics”, Ref. PT) . 

2 The EPR Paradox 

The EPR experiment in the version of Bohm m involves the preparation of a 
quantum system such as the one created by the decay of a spinless particle into 
two half-integral-spin particles: 

|^epr) = ^(IU)-UT)) . (1) 

This state represents the superposition of the two possible situations: “left- 
particle spin-up, right-particle spin-down”, and “right-particle spin-up, left-par- 
ticle spin-down” . Let us now imagine that the pair so-created is separated suf- 
ficiently far that classical information would take a long time to travel between 
them. Then, we measure for example the z-component of the spin of one of 
the particles (say, the left one). This measurement has two possible outcomes, 
which occur with probability one-half each, implying that the von Neumann un- 
certainty of the density matrix describing any one of the particles (denoted by 
subscripts L and R), 
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PL,fl = i|T)(Ti + i|i)ai (2) 

is one bit 

S{pl,r) = -TrR^L (pL,fllog2 = 1 (3) 

in spite of the fact that entropy of the combined system vanishes. The latter 
is of course well-known: for a quantum mechanical “pure state” (p|pj^ = Pepr, 
where pepr = |!^epr)(!^epr|) the von-Neumann entropy vanishes 5 '(pepr) = 0, 
i.e., the state is perfectly well-known. 

Clearly then, the quantum nature of the EPR state is very peculiar since 
the uncertainty of a part of this system can be larger than the uncertainty of 
the pair. Classically, this is impossible. Indeed, if we describe uncertainties using 
(classical) Shannon entropies, the Shannon entropy of a system A, say, with 
A c AB, is 

H{A) < H{AB) . (4) 

This property of monotonicity of entropies is violated in quantum mechan- 
ics m This violation, on the other hand, can be described consistently in 
an information-theoretic formalism which allows for negative conditional en- 
tropies In other words, there exists an information theory, extended to 

the quantum regime, in which the violation of classical laws such as monotonicity 
are inevitable consequences. 

Quantum enianpZement situations, such as encountered in EPR pairs, are pro- 
totype systems to examine the classically forbidden regime of negative entropies. 
In the case at hand, the joint, conditional, mutual, and marginal entropies of the 
EPR pair can be summarized by the entropy diagram in Fig. 1. Such diagrams 
are used extensively in classical information theory and serve as mental scratch 
pads to remind us of the separation of unconditional entropies into conditional 
and mutual pieces. While in the past the violation of monotonicity prevented 
the use of Venn diagrams in quantum information theory, the introduction of 
negative entropies has reinstated this useful tool |21i2ill5IRij . In particular, we 
can see how 

S{L) ^ S{LR) . (5) 

is possible in Fig. 1 if S{L\R) is negative. 

The repercussions of such an information-theoretic description of entangle- 
ment for the extraction of information from such a state (a measurement) are 
manifold. Here, we focus on EPR experiments and other quantum paradoxes, 
and on implications for physical as well as quantum pictures of reality. 

3 Information Theory of EPR Experiments 

In order to assess the relation between quantum and physical reality in an EPR 
measurement, we need to describe both the quantum system (the EPR wave- 
function) and the classical devices it becomes entangled with, using information 
theory. 
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Fig. 1. Quantum entropy diagrams, (a) Definition of joint [^(Li?)] (the total 
area), marginal [5'(L) or S{R)], conditional [5(L|i?) or ^(i^lL)] and mutual 
[S{L : i?)] entropies for a quantum system LR separated into two subsystems L 
and R; (b) their respective values for the EPR pair. 




Fig. 2. Measurement of EPR pair Q 1 Q 2 by devices Ai and A 2 . 

Qi Qj 

i i 

^ EPR ^ 




Let Ai and A 2 denote measurement devices, each of the devices measuring 
the z component of one member of an EPR pair, for example (see Fig.|2I). 

It is an experimental fact that the measurement of the state of one of the 
particles (say, CTz) allows a 100% accurate prediction of what the outcome of the 
measurement of the other one will be. Thus, the outcomes of the measurement 
of (Tz are perfectly correlated, a situation described by the entropy diagram in 
Fig-EKi which appears perfectly classical (no negative numbers appear). 

Note that the correlations between the devices are quite unlike those of the 
quantum system that is measured, a peculiarity that is quantitatively manifested 
when comparing Figs, lb and Et. The reason why the correlations between the 
measurement devices (Fig. Et.) incompletely mirror the entanglement present 
in the quantum state (Fig. lb) must be due to the device’s classical nature: 
classical conditional entropies cannot be negative. However, classicality must 
not be assumed for the devices, it is a mathematical result of the information- 
theoretic treatment of measurement (which gives rise to Fig. Eh) [T7>| . 

Assume now that orthogonal spin projections are measured on the two halves 
of the EPR pair, say cTz on the left particle, and cr^ on the right one. If we as- 
sume that measuring the state of one partner confers reality to the state of the 
measured system, we must conclude that the experiment just described would 
allow us to infer the z and x projections simultaneously, a state of affairs strictly 
forbidden by the uncertainty relation. In their landmark paper EPR there- 
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fore conclude that, since conventional quantum mechanics cannot describe this 
peculiar situation, the theory must necessarily be incomplete. This is the essence 
of the EPR paradox. It relies on a definition of reality based on “certain predic- 
tion”0 according to which the state of the second particle would acquire physical 
reality after measuring its EPR partner. In fact, for this particular experiment 
(measuring ctj on the left and ax on the right particle) it is found that the 
outcomes recorded by the devices are completely uncorrelated as depicted by 
the classical entropy diagram for the devices pictured in Fig. 03. Rather than 
reflecting an incompleteness of the formalism, these outcomes are predicted by 
quantum information theory, and imply that physical reality is attributed to the 
state of the second measurement device, or more precisely the relative state of 
the devices, while there cannot be any correlation between the apparatus and 
the quantum state proper (as we show below). In view of the importance of this 
conclusion, let us repeat it once more. Quantum information theory predicts that 
in EPR-type measurements, the measurement device cannot reflect the state of 
of the quantum system. In the language of Mermin ini. the correlations between 
the devices are real, i.e., possess physical reality, while the quantum system itself 
does not. 



Fig. 3. Entropy diagram for the devices: (a) recording az for each of the entan- 
gled particles, or (b) recording for one and ax for the other particle. 

“ A. A, 





Let us show this in more detail. For a proper quantum information-theoretic 
analysis, we need to consider four systems: a quantum pair Q 1 Q 2 and a pair 
of ancillae A 1 A 2 . The ancillae can be thought of as classical devices that are 
built to reflect the quantum states. From a measurement point of view, we are 
interested in the correlations between the ancillae, as only such correlations are 
experimentally accessible (relative states). Before we analyze them using quan- 
tum entropy diagrams, let us ponder what we expect to And from an orthodox 
point of view. 

One of the fundamental tenets of classical measurement theory is that a 
measurement device is constructed such as to mirror the state of the object 
to be measured as accurately as possible. In other words, measurement entails 

^ EPR wrote in “If, without in any way disturbing a system, we can predict with 
certainty (i.e., with probability equal to unity) the value of a physical quantity, then 
there exists an element of physical reality corresponding to this physical quantity.” 
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the transfer of this information to a macroscopic system that is more suited to 
accurate observation, without altering the state of the system. While it is well- 
known that quantum measurements cannot be made without altering the quan- 
tum state the general belief is that the quantum state after measurement 
is truthfully portrayed by the device. In other words, it is believed that correla- 
tions between the quantum state and the ancillae in the measurement situation 
allow the extraction of information about the quantum system. Let us consider 
the “orthodox” (classical) entropy diagram (Fig. for an EPR measurement, 
drawing the quantum system Q1Q2 as one system, measured by the ancillae Ai 
and A2- These diagrams reveal the paradox inherent in this description. On the 



Fig. 4. Classical entropy diagram for the EPR measurement of spin-projections: 
(a) both devices measure (b) one device measures the other ax- Note that 
the entropy of Ai and A2 have to be one bit in each case, as the measurement 
outcomes are equiprobable, while Q1Q2 is thought to have two independent 
equiprobable degrees of freedom. 




one hand, when the same projection of the spin (e.g., a^) is measured for both 
particles (Fig. classical reasoning suggests that the quantum system and the 
measurement devices share information (one bit in the center of the diagram). 
On the other hand, when orthogonal polarizations are measured (Fig. 0 d) the 
measurement devices must appear uncorrelated. According to a “physical real- 
ism” or “hidden variable” picture, both diagrams in Fig. 0 must have a common 
underlying classical diagram relating five ensembles: the EPR pair Q1Q2 and the 
four possible measurements Ai[(Ta;], A2[az], and A2[ax]3 This underlying 

diagram, however, is in contradiction with the Heisenberg uncertainty principle, 
as it implies that the counterfactual variables ax and a^ (of the same particle) 
can be measured together. Thus, this classical treatment of information leads to 
a paradox. 



^ The diagrams in Fig. 0 are obtained from such an underlying diagram by ignoring 
two out of the five variables: Fig. ^ by ignoring Ai[(Ta;] and A 2 [ax], Fig. ^ by 
ignoring Ailuj,] and A 2 [az], “Ignoring” a system is achieved by the mathematical 
operation of tracing it out of the joint density matrix. 
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Fig. 5. (a) Quantum entropy diagram for the EPR measurement of same spin- 
projections: e.g., Ai and A 2 both measure (b) reduced diagram obtained by 
tracing over the quantum states Qi and Q 2 - 



(a) QA 




(b) Q,Q, 





The paradox is resolved by drawing the quantum entropy diagrams for the 
measurements (Figs. El and ED- The values for the respective quantum entropies 
entering these diagrams can be obtained by straightforward calculation |I3 . In 
Fig. 0 the entropy diagram representing the situation where the same polariza- 
tions are measured is that of a GHZ state 123: fully symmetric and maximum 
quantum entanglement between three entities. As is well-known, tracing over 
(ignoring) one member of such a triplet produces classical correlations (of the 
type depicted in Fig. in the remaining doublet, as indicated in Fig. 03. As 
a consequence, the quantum entropy diagram of Fig. correctly reproduces 
the observed correlations between the detectors Ai and A 2 . Closer inspection of 
Fig-01, however, reveals that while the measurement devices are perfectly corre- 
lated as they should, their mutual entropy (the single bit of information gained 
in the measurement) is not shared by the quantum system Q\Q 2 - In Figs. 0and 
0 this ternary mutual informatioifl is represented by the center of the diagram, 
and measures how much of the correlations between the measurement devices 
is shared by the quantum system. If the center of the diagram is zero, we must 
conclude that no information is shared between quantum system and classical 
devices. 

The same is true for the measurement situation in Fig.Et, where incompatible 
polarizations are measured. Again, the (four part) system is fully entangled, and 
ignoring the quantum state produces the experimentally observed independence 
of the measurement results (Fig. Eb, compare Fig. 03). Yet, the correlation be- 
tween quantum state and measurement device (the mutual information between 



^ Just like any entropy, information, which is the mutual entropy between two sys- 
tems, can be split up into a conditional and a mutual piece with respect to a third 
system H3- 
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the measuring and the measured system) is unchanged from the previous ar- 
rangement, in fact, it vanishes in both casefl 



Fig. 6. (a) Quantum entropy diagram for the EPR measurement of orthogonal 
spin-projections, e.g., A\ measures Ux while A2 records Uz- (b) Reduced diagram 
as above. 



(a) 



QA 




(b) 



QA 

-2 



AjiOxl 




AziOxl 



This situation leads us to suggest that we must abandon at least one cher- 
ished notion of orthodox measurement theory: that the apparatus necessarily 
reflects the state of the system it was built to measure, by being correlated with 
it in the sense of Shannon. Rather, it is the correlations between the ancillae (the 
reality of their relative state) that create the illusion of measurement. Indeed, any 
subsequent measurement on each side (left or right), for example, would yield 
the same result, over and over again, while still not implying anything about 
the quantum wavefunction. Each observer that repeats a measurement becomes 
classically correlated to the earlier outcome, whatever the outcome. Still, the 
quantum reality of the superposition is unperturbed by these measurements: 
none of the repeatable measurements yield any information about the quantum 
state, while they are internally completely consistent. Note that the orthodox in- 
terpretation of these correlations involves the concept of a wavefunction collapse: 
the measurement of the first particle projects — or collapses — the wavefunction 
of the other one, to account for the perfect correlation. Since the devices do 
not reflect the state of the quantum system, however, no collapse is needed to 
explain the correlations, nor does it actually occur, as we now show. 



4 Information Theory of Schrodinger Cats 

The Schrodinger-cat paradox is of prime importance for the understanding of 
quantum decoherence and the quantum-classical boundary. The latter have re- 
ceived increased attention recently due to their importance for the design of 
quantum computation and communication devices m- 

The mutual information between qnantnm system and both classical devices also 
vanishes for any intermediate situation between Figs.ElandEl since the joint system 
Q1Q2A1A2 is always a pure state m- 
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The Schrodinger-cat paradox explores the relationship between classical and 
quantum variables by coupling them together in such a way that the decay of 
a radioactive substance (say, one isolated atom) implies the demise of a cat 
locked up with the deadly contraption in a sealed room. The quantum reality of 
the (isolated) atomic system is that of a superposition of a decayed atom with 
gamma ray, and an undecayed atom without. If brought into contact with the 
cat, however, quantum mechanics forces us to include the cat in this entangled 
wavefunction 

\^) = ^{\A\0,L) + \A,1,D)) , (6) 

where |A*) and |0) refer to the excited atom and absent gamma, while |A) and 
|1) are the wavefunctions of the decayed atom and the gamma. Furthermore, 
\L) and \D) refer to the “live” and “dead” cat eigenstates. The paradox arises if 
an observer peeks into the room to observe the state of the cat. Does the cat’s 
wavefunction immediately collapse into one of its eigenstates (dead or alive) upon 
observation? The preceding analysis teaches us that this is not necessary. The 
observer can be thought of as a fourth system that is now quantum entangled 
with the previous troika: atom, gamma, and cat 

\^) = ^{\A\Q,L,l) + \A,l,D,d)) , (7) 

introducing “observer eigenstates” |l) and \d). Then, upon tracing over the quan- 
tum degrees of freedom of the atom (after all, this experiment involves moni- 
toring the cat and not the atom), the cat (serving as the hapless gamma-ray 
detector) appears perfectly correlated with the observer peeking into the room. 
Cat and observer agree, so to speak, about the observation, and their state is 
entirely classical. Yet, their agreement is completely decorrelated, disjoint, from 
the quantum state, as their mutual information shared with the atomic sys- 
tem vanishes. In other words, the classical reality displayed by cat and observer 
does not imply anything about the quantum reality of atom and gamma ray, or 
vice versa. Fundamentally, the reason why the observer does not register a cat 
mired in a quantum superposition of the living and non-living states is because 
the observer, having interacted with the cat, is entangled with, and thus part 
of, the same wavefunction. As the wavefunction is indivisible, an observer (or 
measurement device) would have to monitor itself in order to learn about the 
wavefunction. This is logically impossible. 

5 Conclusions 

To summarize, we assert that quantum reality is “real” in the sense that quantum 
mechanics completely and deterministically describes the evolution of a closed 
system (not just its wavefunction), and that the statistical character arises from 
the fact that an observer, because he is part of the closed system, is offered an 
incomplete view of the quantum system he attempts to measure. Consequently, 
the quantum universe is deterministic as Einstein’s physical reality demands. 
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but must include the observer as one of its parts due to the inseparability of 
entangled quantum states. The recent information-theoretic analysis of quantum 
measurement shows that such an observer indeed perceives the system 

he is measuring as probabilistic, and thus that Bohr’s complementarity principle 
emphasizing the importance of the system/observer relation therefore holds at 
the same time. If quantum reality is so elusive, how then can we learn about 
its nature? Fortunately, while negative entropy cannot be reflected in classical 
instruments directly, it is possible to infer it from combined measurements and 
comparison with classical bounds (a case in point are Bell inequalities [^, see 
also m- Thus, quantum reality does leave its traces in experiments, while the 
direct observation of superpositions is impossible. 
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Abstract. The concepts of conditional entropy of a physical system 
given the state of another system and of information in a physical sys- 
tem about another one are generalized for quantum one is that the en- 
tropy and information in quantum systems. The fundamental difference 
between the classical case and the quantum one is that the entropy and 
information in quantum systems depend on the choice of measurements 
performed over the systems. It is shown that some equalities of the clas- 
sical information theory turn into inequalities for the generalized quan- 
tities. Specihc quantum phenomena such as EPR pairs and ’’superdense 
coding” are described and explained in terms of the generalized condi- 
tional entropy and information. 



Keywords: Quantum Information, Quantum Conditional Entropy, Informa- 
tion in Entangled States, Quantum Measurement 

The purpose of this paper is to show how the concepts of conditional entropy 
and information (by C.E. Shannon) can be generalized for quantum systems. It 
is shown that the freedom of choice of measurement performed over quantum 
systems results in some classical information-theoretical equalities becoming in- 
equalities. The generalized concepts are applied to describe typical quantum- 
mechanical phenomena of EPR pairs and the so called ” superdense coding” . 

Consider a quantum system that consist of two subsystems, A and B. Let 
a and (3 be complete sets of variables in A and B, respectively. Let {a} and 
{6} be sets of values taken on by a and /3, respectively. The Hilbert space of 
the states of the system 7i is the tensor product of the Hilbert spaces of the 
subsystems: = Ba ®'Hb- The state of the system is , in general, a mixed one 

and is described by a joint density matrix in 

p{A,B) = \\pab,a'b'\\ ( 1 ) 

where the matrix elements are written in the basis formed by tensor products 
of eigenvectors of a and /3.The marginal density matrices of the subsystems are 
given by partial traces: 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 2fi9- 07!l 1999. 
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p{A) = TrBp(A B) = II Pab,a'b\\ 

p{B) = lrAp{A,B) = \\Y.^Pab,ab'\\ ( 2 ) 

The joint entropy of the system and the marginal entropies of the subsystems 
are, respectively, 



H{A,B) = -Trp{A,B)logp{A,B) 

H{A) = -Trp{A)logp{A) 

H{B) = -Trp{B)logp{B) (3) 

Denote by H{a,P), H{a), and H{(i) the entropies of the results of the mea- 
surements performed over the states p{A, B), p{A) and p{B), respectively , in the 
bases formed by eigenvectors of a and f3. Then, according to Klein’s Lemma[l,2], 

H{A,B) < H{a,P); H{A) < H{a); H{B) < H{f3) (4) 

where the inequalities hold iff the corresponding density matrix is diagonal 
in the measurement basis. Inequalities (4) express, by von Neuman[2], the effect 
of the irreversibility of quantum measurements. 

The next step is to introduce conditional density matrix [3,4]. Denote by 
Pb the projection operator on the eigenvector corresponding to eigenvalue b of 
the complete set of variables [3. Then the conditional density matrix of the 
system A, given that the result of measurement of l3 performed over system B 
is 6, is 



p{A\(3 = b) 



PbpjA, B)Pb 
JrAp{A, B)Pb 



Pab,a'b 



X[o Po-b, 



ab 



( 5 ) 



Note that the denominator in (5) is just the probability of (3 to take on value 



lrAp{A, B)Pb = Pr{/3 = h} (6) 

Also, for any b, 'TrAp{A\(3 = 6) = 1 ( as it should be for a density matrix), and 

^(Tr^p(A, B)Pb)p{A, 1/3 = 6) = p{A) (7) 

b 

Then, in accordance with Shannon’s definition of conditional entropy, and 
the quantum definition of entropy [2], the conditional entropy of system A given 
the measurement /3 performed over B, is 

77 (A|/3) = - ^(Tr^p(A, 73)P,)Tr^p(A|/3 = 6)log p{A\(3 = b) (8) 

b 

By Klein’s Lemma, for any a and /3, 



0 < 77(A|/3) < H{a\(3) 



(9) 
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where the equality holds iff all p{A\(3 = b) commute and are diagonal in the basis 
of eigenvectors of a. 

The quantity given by (8) depends on the choice of measurement l3 (or, more 
exactly, on the choice of basis in Hb formed by the eigenvectors of the complete 
set of variables /3) . This situation is different from that in classical theory, where 
any complete set of variables would give the same conditional entropy for the 
system A. Since the meaning of conditional entropy is to express the uncertainty 
of the subsystem A under the constraints imposed by any possible measurement 
performed over the subsystems B, we suggest the following definition. 

Def. 1. Conditional entropy of a subsystem A, conditioned by a subsystem 
B, is 



H{A\B) = inf H{A\P) (10) 

Theorem 1. 

H{A\B) < H{A) 

Proof. 

H{a\(3) < H{a) 
ird H{a\(3) < H{a) 



H{A\B) = infy^infiJ(a|/3 = b) < infinfi?(a|/3) < iniH{a) = H{A) □ 

b 

Note that according to classical information theory iJ(o;|/3) = H{P)+H{a\P). 
However, this equality turns into inequality for quantum systems. 

Theorem 2. 



H{A,B) < H{B) + H{A\B) (11) 

The inequality (11) stems from the fact that H{A, B) = inf^H{'-f), where 7 is a 
basis in H, which is not necessarily a tensor product of bases in Ha and Hb- 
Turning to information, we should remember that information ( in Shannon’s 
sense) is defined iff there is a pair of random variables. In the case of quantum 
physics, these random variables are observables, and their values are outcomes of 
measurements. Thus, if a and /3 are complete sets of observables for systems A 
and B, respectively, we may consider mutual information between the outcomes 
of measurements of a and l3 : 

I{a;P) = H{a) - H{a\P) (12) 

Def. 2. Information in the subsystem B about the subsystem A ( and vice 
versa) is 

I{A;B) = sup I{a;/3) 

a,l3 

Then the classical equality (12) turns into inequality. 



(13) 
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Theorem 3. 



I{A- B) < H{A) - H{A\B) (14) 

The equality in (14) holds iff for [3 such that H{A\B) = H{A\P) all p{A\(3 = b) 
commute and is achieved for the basis of eigenvectors of a such that all p{A\(3 = 
b) are diagonal there. 

Proof. It follows from the entropy defect principle [5,6] that for any (3 

I {A; B) = sup J(a; (3) < H{A) - H{A\!3) (15) 

a 

where the equality holds iff there exists a basis of eigenvectors of a where all 
p{A\(3 = h) are diagonal. 

From (15), 



I{A-, B) = sup/(a; /3) < sup[iJ(T) — H{A\P)] = H{A) — inf H{A\P) = 

a,0 0 0 

H{A)-H{A\B) □ 



Example 1: EPR pair 

Consider two particles, A and B, each having a 2-dimensional Hilbert space. 
Let {jo), |1)} be the basis in Ha and the basis in Hb formed by eigenvectors of 
a and eigenvectors of (3, respectively. Let the system be in pure entangled state 
with a density matrix 



p(H,H) = i(|01)-|10))((01|-(10|) 

In other words, the density matrix in the (a, P) basis is 



p{A,B) 



0 0 0 0 

0 1-10 
0-110 
0 0 0 0 



The marginal density matrices are : 



P{A) 



'1 o' 


. X 1 


'1 o' 


0 1 


; P{B) = - 


0 1 



The conditional density matrices of the subsystem A are: 



(16) 



(17) 



p{A\p = 0) 



; p{A\p=l) 



1 0 
0 0 



(18) 



Hence H{A, B) = 0 and H{a, /3) = 1; H{A) = H{a) = 1; H{B) = H{P) = 1; 
H{A\B) = H{A\P)=Pi. 
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Note that all p{A\(3 = h) commute and are diagonal in basis (3. Therefore, 



I {A- B) = sup I{a; /3) = H{A) - H{A\B) = 1 

a,f3 



(19) 



Let us now take a different basis in B, formed by eigenvectors of another 
observable $: 



{^(| 0 ) + | 1 )),^(| 0 )-| 1 ))} 



( 20 ) 



In (a, $) basis 



p{A,B) = 



1 



Then 



p{A0 = 0) = i 



1 1 
1 1 



1 1-11 

1 1-11 

-1 -1 1 -1 
1 1-11 



p{A0=l) = ^ 



1 -1 

-1 1 



( 21 ) 



(22) 



Hence, H{a\l3) = l and I(a;/3) = 0 (23) 

But the conditional density matrices (22) still commute, and H{A\(3) = 
H{A\B) = 0. Therefore there should exist an observable d in H such that 

I{A-,B) = I{a-J) = l (24) 

Indeed, this is true for an osbervable a with eigenvectors 

{^(|0) + |1)),^(|0)-|1))} (25) 

Example 2. Superdense Coding 

The phenomenon of superdense coding)?] is essentially quantum-mechanical 
and has no classical counterpart. It can be described as follows. 

Suppose we have an EPR pair of particles B and B' in an entangled state (16) 
and a third particle of the same sort , A, whose intial state is independent of the 
pair (B, B'). Then, by local unitary transformations over the particles A and B' 
one can obtain four different entangled states of particles A and B depending on 
the value of a classical random variable C which takes four equiprobable values 
0,1, 2, 3. Namely 
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Note that states(26) are pure and orthogonal and, therefore, form a basis in 
Ti. — H.A ®'Hb consisting of eigenvectors of an observable 7 in 

Since (7 is a classical variable, the joint density matrix p{A, B, C) must be 
diagonal in C. In the basis {a, (3) = {|00), |01), |10), |11)} the density matrix has 
the following form: 



p{A,B,C) 



0 0 0 0 
0 110 
0 110 
0 0 0 0 



1 

8 



0 0 0 0 

0 1-10 
0-110 
0 0 0 0 



10 0 1 
0 0 0 0 
0 0 0 0 
10 0 1 



10 0-1 

0 0 0 0 

0 0 0 0 

-10 0 1 



(27) 



(The empty blocks, in the above matrix, are all zero.) 

Now it is easy to see that subsystems A and B taken separately carry no 
information about C: 



I{A; C) = I{B; C) = /(a; C) = J(/3; C) = 0 (28) 



The measurement of the observables in both A and B yields only one bit of 
information: 



I{a,(3-,C) = 1 (29) 

as it should be in the case of three random variables in classical information 
theory. However, the measurement of the variable 7 , whose eigenvectors form a 
basis in the H which is not a tensor product of bases in Ha and Hb, yields, 
paradoxically, two bits of information, thereby identifying uniquely the value of 
C: 



I{A,B;C) = I{j;C) = 2 (30) 

This is, indeed, a paradox, since in classical information theory, if a and /3 
are binary random variables, the joint information in (a, /3) about C is upper- 
bounded as follows: 



/(a, /3; C) = I{a] C) + J(/3; C\a) < I{a] C) + H{(5\a) 



(31) 
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Taking into account that H{(}\a) < H{(i) < 1, and, in our case, /(a; C) = 0, 
we obtain that /(a, (i]C) < 1. The example of superdense coding shows that the 
joint information in two two-dimensional quantum systems A and B about ran- 
dom variable C I {A, B\ C)= 2 > 1, inspite of the fact that I{A-, C) = I{B\ C) = 

0. Since in our case H{B\A) = 1, we conclude that an inequality opposite to 
(31) holds here, namely, that 

I{A, B- C) > I{A- C) + H{B\A) (32) 

which resolves the paradox. 

These unusual properties of information between quatum systems results 
from the fact that, in contrast with classical theory, this information depends on 
the choice of measurement, i.e. on the choice of a complete set of one-dimensional 
orthogonal projection operators in the Hilbert space of the system. Let us point 
out in conclusion that definitions of conditional entropy and information for 
quantum systems can be easily modified in an obvious way to include indirect 
measurements, i.e. complete sets of one-dimensional projection operators which 
form a non-orthogonal resoultion of identity in the Hilbert space. 



References 

1. O. Klein, Zeitschr. f. Phys., B.72, 1931, 767. 

2. J. von Neumann, Mathematische Grundlagen der Quantenmechanik, 
Springer- Verlag, Berlin, 1932 

3. V.V. Mityugov, Physical Principles of Information Theory^ Sovietskoye 
Radio, Moscow, 1976 (in Russian) 

4. R. Balian, From Microphysics to Macrophysics. Methods and Applications 
of Statistical Physics, Vol. 1 , Springer- Verlog, Berlin, Hiedelberg, New York, 
1991 

5. L.B. Levitin, On the Quantum Measure of Information, Proc. of the 4th 
Conf. on Information Theory, Tashkent, 1969, 111-116. English Translation: An- 
nales de Fondation Louis de Broglie, v.21, No. 3, 1996, 345-348. 

6. A.S. Holevo, Some Estimates of Information Transmitted over Quantum 
Communication Channel, Probl. of Inf. Transm., v. 9 , No. 3 , 1973, 3-11 

7. C.H. Bennett and S.J. Wiesner, Phys. Rev. Letters, v. 69, 1992, 2881 




Accessible Information in Multi-access Quantum 

Channels 



A.E. Allahverdyan and D.B. Saakian 
Yerevan Physics Institute 

Alikhanian Brothers St. 2, Yerevan 375036, Armenia 



Abstract. The accessible information in multi-access quantum channels 
are considered. Classical messages from independent sources, which are 
represented as some quantum states, are transported by a channel to one 
address. The messages can interact with each other and with external en- 
vironment. After statement of problem and proving some general results 
we investigate physically important case when information is transported 
by states of an electromagnetic field. One-way communication by noisy 
quantum channels is also considered. 



1 Introduction 

Physical ideas played important role as sources for information theory jllt)| . In- 
vestigation of quantum mechanical aspects of information theory was started by 
Gordon, Lebedev and Levitin 0, and other researchers (see references in P). 
Now it is well known that quantum-mechanical and thermodynamical limitations 
are very important for correct consideration of information-theoretical models 

More generally, quantum information theory contains two distinct types of prob- 
lems. The first type describes transmission of classical information through a 
quantum channel (the channel can be noisy or noiseless). In such scheme bits 
encoded as some quantum states and only this states or its tensor products are 
transmitted. In the second case arbitrary superposition of this states or entan- 
glement states can be transmitted. In the first case the problems can be solved 
by methods of classical information theory, but in the second case new physical 
representations are needed. In this work we investigate the problems of the first 
type. 

In the almost all paper devoted to problems of physical information theory only 
one-way communication are considered- i.e., there is one input with some initial 
quantum ensemble and one output where quantum states are detected. Here 
quantum ensemble is some set of quantum states (which can be represented by 
corresponding density matrices ) with corresponding probabilities. But in the 
practice multi-terminal communication schemes also are important. In this case 
there are several input for information and several output for detection. In this 
communication scheme messages are represented as some physical systems and 
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there are interactions between the messages and external environment. For gen- 
eral discussion about multi-terminal classical, mathematical information theory 

see [Tn| . 

The paper is organized as follows. In section 2 we discuss the general statement 
of the problem, and derive some upper bounds for the accessible information like 
Holevo bound in one-way communication ^ . In section 3 practically important 
case is considered when information is transmitted by coherent and quadrature- 
squeezed states of an electromagnetic field. 

2 The Upper Bounds for Accessible Information. 

We consider one output which receive information from two independent sources. 
The messages of these sources are represented as some quantum states. More 
exactly we can say that for any letter of the classical alphabets the concrete 
quantum state is generated. The initial quantum ensembles of two independent 
sources are 



a 


(1) 


= i: Av?'. 


(2) 



/3 



After initial preparation the quantum states of (Pi) penetrate through a quan- 
tum channel. In this channel there are interactions between states of the sources 
and an interaction with the environment. The possible concrete mechanisms of 
these interactions will be discussed later. The general effect of the noisy quan- 
tum channel can be described by a quantum evolution operator S with kraussian 
representation 

Sp='^AlpAf,, = (3) 

This operators must be linear, completely positive and trace-preserving PE!. 
After interaction, the receiver has the states 




These states are contained in the quantum ensemble 

CT = (5) 

a,/9 

At the output of the channel the receiver should separate and recognize the 
messages of each source. Besides a noise which is introduced by the environ- 
ment each transmitter also acts as a noise for other. Now the receiver needs 
some measurement procedure. It is important that the elements of (1 1 1211 can be 
nonorthogonal or nonorthogonality can occur after action of . In this case for 
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more optimal distinguishing between different quantum states we need gener- 
alized measurement procedure This type of measurement is represented by 
some nonorthogonal resolution of identity 

if-y = 1, if-y > 0 (6) 

7 

If system with density matrix p is measured then probability of the result 7 
is tr{pE^). In P| was shown that for distinguishing some nonorthogonal states 
measurements like (El more optimal than usual. At the output of the channel 
(El as result of some generalized measurement like (0 we have the conditional 
probabilities 

p(7/a/3) =tr(AyCr^^y3^^) (7) 

This procedure is called decoding. With initial and independent distributions 

P{a), p{P) (8) 

the dual multi-access channel in the classical sense is determined 1 1 l)lt)j . 

Let i?i, i?2 are the maximal quantities of information which the sources can 
transport in the regime of the reliable connection (it is the connection with 
small probability of an error in the decoding). As was shown in EUI such i?i, i?2 
must satisfied the following conditions 



i?i < /(a : 7//?), i?2 < /(/3 : 7/a), 



(9) 



Where 



Ri + R 2 < I{a (3 : 



7(a:7//3)= A t) In 



p(a, / 3 , 7) In 

q:,/3,7 



p{l/aP) 

P{l) 



( 10 ) 

( 11 ) 

( 12 ) 



The second value is usual mutual information between ensembles 7 and a® ( 3 . 
The first value is called mutual-conditional information (mc-information) . The 
mutual information of two ensembles is reduction of entropy of one ensemble if 
the other is observed. Mc-information has the same meaning but after realization 
of the conditional ensemble. The physical meaning of (0 E3) is follows. Eq. 
dmi is usual Shannon formula for joint channel, i.e. when considering the pair 
of symbols presented at both inputs as a single symbol of a larger alphabet. 
Eq. 0) arises because messages from the different inputs should be separated 
at the output. Thus the amount of information which can be transported by 
the first transmitter is limited by mutual information between the first input 
and the output when the second input is fixed, and analogously for the second 
transmitter. Now for long sequences in the inputs and memoryless external noise 
we have mc-information. 
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The problem of physical information theory in this case is investigation the 
results of dUD for physically important noisy channels. 

In general case the values ED, (112) can not be calculated explicitly. Therefore 
investigation the upper bounds for this values is an important problem. For the 
one-way communication such theorems was proved by A.Holevo 0. The most 
general results in this direction was obtained in the [^. We shall obtain the 
measurement independent upper bounds for ED, dI3 by using the methods 
of 13 . For this purpose we need some general results from quantum statistical 
physics. 

Quantum relative entropy between two density matrices pi, p 2 is defined as 
follows 

5'(pi||p 2) = tr(pilogpi - pilogp 2 )- (13) 

This positive quantity was introduced by Umegaki m and characterizes a degree 
of ’closeness’ between density matrices pi, P 2 - The properties of quantum relative 
entropy were reviewed by M.Ohya m Here we need one basic theorem which 
was proved by Lindblad m 

^(Pl||p2)>,5(5pi||5p2). (14) 

I.e. after action of S the quantum states can be only more ’close’. For any 
generalized measurement {E^} and density matrix p we define the following 
transformation 

p ^ diag{tr(F;ip),tr(£’2p), ....} (15) 

In other words p is transformed to a diagonal matrix with the corresponding 
diagonal elements. This map is also general quantum evolution operator like ®. 
Now we shall use map and theorem (ED for values like 

tr[5(pW (g)p^^^)(ln5(pi^^ -lnA(p(^) (g)p^^^))] (16) 

We get 

I{a : 7 //J) < ®p^^^)) (17) 

a,0 0 

/(a® /3 : 7) < - S{S{p^^'> ® p^^^)) + S{S{p^^'^ 0 p^^^)) (18) 

a,/3 

These inequalities are very useful if general limits are developed. Eqs. II I Yl II iSll 
are the central results of this section. The attainability of this bounds will be 
discussed elsewhere. 

3 Information Transmission by States of an 
Electromagnetic Field. 

In practice the most important tool for information transmission is the electro- 
magnetic field. We briefly recall the connection between the formalism which 
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is described above and the characteristics of the electromagnetic field in the 
vacuum or linear dielectric media Here the information is connected with 
longitudinal characteristics of a plane wave with fixed center-frequency and nar- 
row bandwidth. But transverse (polarization, wave vector) characteristics are 
fixed. This statement of question is more or less realizable in the practice. 

Two modes of the electromagnetic field with frequencies wi, u >2 penetrate into 
the noisy channel. We have considered two kinds of communication channels, 
which are coherent channel and quadrature-squeezed (briefly squeezed ) chan- 
nel. In the first case the inputs of the sources are coherent states of chosen field 
modes, and information is carried in the pattern of complex amplitude exci- 
tations. In the second case the inputs are squeezed states, and information is 
carried in the pattern of excitations of squeezed quadratures. Relative to a co- 
herent states a quadrature-squeezed states have reduced quantum uncertainty 
in one quadrature component, called the squeezed quadrature. There is a corre- 
sponding increase in the uncertainty in the orthogonal quadrature component, 
called the amplified quadrature. For the case of coherent states information can 
be recovered by heterodyne detection, i.e., by measuring both quadratures of 
the mode. For the case of quadrature-squeezed states information is recovered 
by measuring of squeezed quadrature. In the channel the modes can interact 
together and with external thermostat. We assume that the interaction between 
modes and the interaction with the thermostat are linear (for discussion about 
realization of this type of interaction see M)- 

We describe our model by quantum Langevin equation m where rotating wave 
approximation is done. 



lai = WiOi — t— 1- KQ2 + iri, 



(19) 



7^2 

10,2 = ^202 — — I- kai -I- 1 F 2 



( 20 ) 



Where oi, 02 are annihilation operators for the modes, k is the strength of 
the cross mode interaction, 7 is the damping constant (for simplicity we choose 
damping constant same for the modes), F 2 {t) are langevin forces (white 

noise). These equations are generated by Hamiltonian 



F[ = 0Jia\ai -I- u>2a\a2 + k{a\a2 + a|ai) (21) 

Solution of these equations can be obtained immediately. For example 



ai = ai( 0 )(ee*'^i‘ + (1 - e)e*^^‘) - 02(0) v"e(l - e)(e*'^^‘ - 

Fi{t')dt' + [ e*‘^=‘>2(t')*' (22) 

Jo Jo 



Where: 



, -7 X Wi -I- W 2 ± 7 (wi - UJ 2 F + ‘^k? 

01,2 — — Ai ,2 + M ,2 — 



2 



( 23 ) 
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1 - £ _ (Ai - u;i)^ 

e 



(24) 



{F^{t)Fi{t')) = 'rriT{Xi^ 2 )SijS{t - t'), riT = (exp (Ap2/T) - 1) ^ (25) 

{F,{t)F]{t')) = 7(nr(Ai,2) + - t'), {F,F,} = 0 (26) 

Now we assume that modes oi, 02 at the t = 0 (in the input of the channel) are 
in squeezed states with squeezing parameters r\, (some of these parameters 
can be zero) and shifts (oi, 02), (/3i, /32) (for definition of squeezed states see P 
) . In the output of the channel (at the moment t) one of the modes (for example 
oi) is measured. The case when two modes are measured, and decoders can 
interchange the available information can be obtained by simple modification of 
the final results. We shall consider only two type of measurements. In the case 
of heterodyning both component of the mode is measured simultaneously: P 



pij/aP) 






(27) 



Where I7) is coherent state with shift 7, and p{t; oi) is the density matrix of the 
mode oi at the moment t. In the second case the squeezed component of the 
mode is measured. In this case 



p(7i/a/3) = (7i|p(t;ai)|7i) (28) 

Where I71) is an eigenstate of the measuring component. Later we shall treat 
homodyne and heterodyne measurements simultaneously, and shall use symbol 
7 for the both cases. 

Simple analysis shows uni that capacities are maximized (the maximization 
is done by all distributions with fixed initial parameters) by gaussian input 
probabilities 

p{a) - exp {-^o^Kaa), p{/3) ~ exp (-i/3^AT^/3) (29) 

Where 

a^ = {ai,a2), = (/3i, /?2)- 

Here and in future gaussian integrals will be written up to multiplicative factor. 
The Langevin equations (1 1 1121 )» are linear and after time t a gaussian state 
remains the gaussian. It is convenient to calculate (1281271 in the formalism of 
Wigner functions. The Wigner function of general gaussian state can be repre- 
sented in the following form as function of the means and square-means of this 
state 

wm T ^ ! (R-67- (Rea))2 (Im7-(Ima))2 

W(Re 7 ,Im 7 ) ~ exp(-^ —K 22 

— (Rey — (Rea))(Im7 — (Ima))iLi2) (30) 

Where 

(KxxKx 2Y ^(cxxcx^\ .3^. 

\KX 2 K 22 ) \C 12 C 22 J ^ ’ 
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cii = ((Rea)^) — ((Rea))^ 

C22 = ((Ima)^) - ((Ima))^ 

(Realma + ImaRea) , , \ 

C12 = C21 = r (Ima) (Rea) 



( 32 ) 



As we see this channel is gaussian. Capacity region for two-terminal gaussian 
channel can be obtained exactly. If vectors a, j3 with distribution (121 are initial 
messages for the channel then we have for output vector 7 



7 — Aicn ~h A 2 P z. (33) 

Where A \ , A 2 are some matrices (we shall use the specific form of these matrices 
for homodyne and heterodyne measurements ), and z is additive gaussian noisy 
vector with correlation matrix K~^ 



{z^) = K 



-1 



(34) 



In this case we get 

1(7 ■. a® f}) = lndet(I -I- KAiK~^Aj + KA 2 K'^^A 2 ) 

1(7 : a/P) = lndet(I -|- KAiK~^Aj) 

: P/a) = lndet{l + KA 2 Kp^A 2 ) (35) 

After some not hard but tedious calculations we come to the concrete results. 
The first case is heterodyne measurement of the mode 1 at the moment t with 

ri = 0, T2 = 0. (36) 



In this case we should choose the following initial distributions 



K~^ = 



With 



X 0 

Ox ) ’ 
1 



= 



1 



y 0 

0 y 



x=^ni, y=-n2. 



(37) 



(38) 



Where ni, n 2 are mean photon number of the initial distributions. In this case 
we have from general formulas 



7(7 : a 0 /3) = In ( 1 -|- 



nie -I- 2(n2 - ni)e(l - e)e ^‘(1 - cos(t(Ai - A2))) 



i(e-7t + iZ/+l) 



7(7 : a//3) = In ( 1 -I- 



nie '>'*(1 — 2e(l — e)(l — cos(t(Ai — A2))) 

i(e-7t + iZ/ + l) 

And for ^ we have 

'7' = (1 — e "'’*)(2eh7’(Ai) -I- 2(1 — e)h7’(A2) -I- 1) 



(39) 



(40) 
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We don’t write expression for J(/3 : 7 /a) if written formulas are sufficient for 
understanding the behavior of this quantity. 

For t = 0 these information measures coincide with well known expression for 
capacity of coherent state channel. For large t (E3 tend to zero. But as we see the 
decay is not monotonic: the additional oscillation occur due to the interaction 
between different modes. If we take e = 1 we come to one-terminal channel with 
gaussian noise. In this case the capacity monotonically tends to zero. 

Now about homodyning with arbitrary r. In this case we should choose as initial 
distributions 

V =(?;;) (") 

It is product of the gaussian distribution for the measuring component and 
delta- function for the other component. With the choice of (jnj we have optimal 
distribution of the input energy. In this case we have the following connection 
between dispersion of the distribution and mean photon number 

x = hi — sh^ri, y = 712 — sh^r2. (42) 



In the first we consider one-terminal communication with arbitrary squeezing 
parameter r. In this case we have 



4 (Q! : 71) = 2 



1 + 



cos^(At)(n — sh^r) 
ie-2’--b2sin2(At)sh(2r)-bC' 



(43) 



For given n optimal squeezing parameter is determined by the following formula 

2 ^ y^cos‘*(Af) -I- -I- 4C(2h -|- 1) cos^(At) — cos^(At) 

C 

Where 

C=(e^‘-l)(2hT+l) (45) 

As we see optimal time-dependent squeezing parameter tends to zero for large 
t. It is well known that for zero t the squeezed-state channel are more effective 
than the coherent-state one p. Indeed we have 

I = ln(I -I- n) (46) 

for the coherent-state channel and 



I = ln(l -I- 2n) 



(47) 



for the squeezed-state channel. As we see noise beat usefulness of squeezed states, 

and for optimal squeezing parameter we have 

Now about general case for the homodyne measurement. We get 



I{a : 7i//3) = - In 1 -b - 



(rii — sh^ri)7 






( 48 ) 
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I{a (g) /3 : 7i) = - In 




Where 



(ni - sii^ri)ul + (ri2 - sh^r2)wi + 




ui = e '''*/^(ecos (Ait) + (1 — e) cos (A2t)) 

U 2 = — (Ait) + (1 — e) sin (A2t)) 
v\ = — e“'*'*/^\/e(l — e)(cos (Ait) — cos (A2t)) 

V 2 = e“^‘/^i/e(l - e)(sin (Ait) - sin (A2t)) (50) 

We can maximize by ri, V 2 the information measure I {a : 7//?) (as we know 
it is information transmitted by user 1), and after this information measure 
I{a ^ P : 'y) — I{a : 7//?) for user 2. The analysis show that situation with 
one-terminal channel is conserved: there are optimal ri, r 2 which tend to zero 
for large t. The second mode introduce additional source for noise. 



4 Conclusion. 

We have considered the noisy dual-access quantum-mechanical channel, and 
compute capacities of this channel. The general theorems are proved which con- 
nect capacities of the channel with some functions from statistical mechanics. It 
is shown that each user acts as noise and can significantly reduce the capacity 
of the other user. After this the information transfer by coherent and squeezed 
states of an electromagnetic field is discussed. The squeezed states loss optimal- 
ity under action of the noise and the optimal squeezing parameter tends to zero 
when time tends to infinity. 
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Abstract. We derive relation between a quantum channel’s capacity 
to convey classical information and its ability to convey quantum infor- 
mation. We also show that these properties of a quantum channel are 
related to the channel’s ability to convey quantum coherent information. 

1 Introduction 

A quantum communication channel can be used to perform a variety of tasks, 
including: 

1. Conveying classical information from a sender to a receiver. 

2. Conveying quantum information (including quantum entanglement) from a 
sender to a receiver. 

Each of these tasks can be performed in the presence of noise. Indeed, in quantum 
cryptography the noise is of central importance in revealing the activity of an 
eavesdropper. 

A central concern in the analysis of any noisy communications channel is the 
channel’s capacity, the maximum rate at which information (classical or quan- 
tum) can be reliably transmitted through the channel. When we use quantum 
systems to convey information we confront concerns which are not present in 
classical channels. A chief concern is the measurement performed by the receiver 
in order to extract the information. 

Measurement of the received message is not particular to quantum channels; 
a receiver using a classical channel must also measure the received message. 
Counting the numbers of dots and dashes in a Morse codeword is a measure- 
ment. What is a particular concern for the user of a quantum channel is that 
a measurement result may be ambiguous even if the quantum channel is noise- 
less. For example, assume that Alice, the sender and Bob, the receiver, agree 
to use vertically polarized photons to represents a “1” and horizontally polar- 
ized photons to represent a “0” . We also assume that Bob measures for photons 
polarized along an axis that is 45 degrees from horizontal. Quantum mechanics 
tells us that, in this case. Bob could not tell if Alice sent a “1” or a “0” . This 
is true even if Alice’s photon reaches Bob without ant distortion in its angle 
of polarization. One might object (quite rightfully in this case) that Alice and 
Bob settled on a particularly silly measurement to use in reading the signals. 
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If the letter states (photon polarizations here) are not orthogonal then there is 
no measurement Bob can perform that will perfectly distinguish them. This is 
a first difference between classical and quantum channels: measurements are, in 
general, ambiguous, even if no noise is present. 

One might suggest that if Bob cannot use a single measurement to decode 
Alice’s signal then Bob should perform several measurements on each photon. 
Quantum mechanics prevents implementation of such a scheme: a measurement 
of a quantum system fundamentally changes the properties of that system. We 
return to our example where Alice and Bob used vertical and horizontal polariza- 
tions and a measurement at 45 degrees. After Bob has measured a given photon, 
it is the in a pure state of 45 degree polarization. That is, if a second polarizer 
is set up immediately after the first with the same (45 degree) polarization then 
the photon will pass it with certainty. If the polarizer is set at any other angle, 
then there is a nonzero chance that the particle will not pass. Indeed, even if 
Alice dispatches the photon with vertical polarization and if it passes Bob’s first 
polarizer, it will then have only a 50% chance of passing a subsequent vertical 
polarizer. This is a second difference between classical and quantum channels: 
measurements of the received signals fundamentally change the signal. 

One might further suggest that if Bob’s measurements change the state of the 
transmitted systems, then Bob should make several copies of each signal state. 
He could the perform a single measurement on each copy and thereby extract 
Alice’s message. Once again, quantum mechanics prevents such a solution. The 
problem here is that general quantum states can not be perfectly copied. This is 
the content of the no-cloning theorem of Wootters and Zurek |3| . This is a third 
difference between quantum and classical channels: quantum states cannot be 
cloned. 

It should be noted that the no-cloning theorem does not imply that Alice can 
not make multiple copies of the states she is sending. All that she needs to do is 
prepare multiple systems in the same way; this is not cloning. If Alice and Bob 
decide to do this to increase the reliability of their channel it would decrease the 
capacity of the channel. This is because capacity is the rate at which information 
is transmitted per letter state. If Alice send more letter states, the capacity of 
the channel is reduced. 

Given what has been said to this point, one might conclude that even finding 
the classical capacity of a quantum channel is problematic at best. In fact, we 
do know the classical capacity, even for noisy quantum channels. This result is 
presented in Section II. 

To this point, we have been concerned with the classical capacity of a quan- 
tum channel. One can also analyze the quantum capacity of a quantum com- 
munications channel. This is the ability of the channel to faithfully transmit a 
quantum state from one system to another. The capacity of a noiseless quantum 
channel is known HD but the case of the noisy channel is still under analysis 
even though some progress has been made |ZQ|. Section III presents the concept 
of coherent quantum information HH| and some of its properties. Section III 
concludes with an anlysis showing the relation between the classical capacity of 
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a channel, the coherent information conveyed by that channel and the quantum 
capacity of the channel. 



2 Classical Capacity of a Noisy Quantum Channel 

Suppose Alice wishes to convey classical information to Bob by using a quantum 
system Q as a communication channel. Alice prepares the channel in one of vari- 
ous quantum states Wx with a priori probabilities Px ■ Bob makes a measurement 
on the system Q, and from its result he tries to infer which state Alice prepared. 
A theorem stated by Gordon P and Levitin jSj, first proved by Kholevo |0|, 
gives an upper bound to the amount of information that Bob can obtain about 
Alice’s signal. If W = J2xPx^x is the density operator describing the ensemble 
of Alice’s signals, then the mutual information H{X : Y) between Alice’s input 
X and Bob’s output Y is bounded by 

H{X :Y)<H{W)-Y,PxH{Wx), (1) 

X 

where H{W) = — Tr VLlog W, the von Neumann entropy of the density operator 
W. The upper bound in Equation Q is in general a weak one, in that Bob may 
not be able to choose an observable that gives him an amount of information 
near to the upper bound p. 

Suppose that Alice employs signal states Wx that are mixed states. Then can 
Alice and Bob find a choice of code and decoding observable so that the general 
Levitin - Kholevo bound (Equation QJ can be approached arbitrarily closely? In 
this paper, we show that the answer to this question is “yes” . That is, we prove 
the following result: 

Theorem. Suppose we have letter states Wx with a priori probabilities 
Px^ and let 

X = H{W)-Y^PxH{Wx). 

X 

Fix e,(5 > 0. Then for sufficiently large L, there exist a code (whose 
codewords are strings of L letters) and a decoding observable such that 
the information carried per letter is at least x~ ^ the probability of 
error Pe < e. 

The proof employs an average over randomly generated codes to establish the 
existence of a satisfactory code. (If the average probability of error is small for an 
ensemble of codes, the ensemble must contain specific codes with small proba- 
bility of error.) We also use a similar prescription for Bob’s decoding observable. 
The chief refinement in the proof presented here is the enforcement of stronger 
“typicality” conditions on various quantities associated with the channel. 

The mixed states Wx may be thought of as the outputs of a noisy quantum 
channel. Thus, our main result will enable us to draw conclusions about the 
classical information capacity of a noisy quantum channel. 
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We have shown that it is possible to send information at any rate up to y 
bits per letter with arbitrarily low probability of error. The capacity of a channel 
is defined as the maximum information per letter that may be sent through the 
channel with Pe arbitrarily small. Thus, x provides a lower bound to the capacity 
of the quantum channel. 

Classical information theory together with the Levitin - Kholevo Theorem 
also allows us to use x to establish an upper bound for the capacity of the 
channel. Suppose X represents Alice’s input and Y represents Bob’s decoding 
measurement outcome. Then the Fano inequality uni states that 

- PeIoePe - (1 - Pi3)log(l - Pe) + PslogiNx - 1) > H{X\Y) (2) 

where Pe is the probability of error and Nx is the number of possible values 
of X. H{X\Y) is the conditional Shannon entropy of X given Y — that is, the 
entropy of the conditional distribution p(x\y), averaged over the various values 
of y. It is related to the mutual information H{X : Y) by 

H{X\Y) = H{X) - H{X :Y). (3) 

In the channel, Alice uses some signal states pa with probabilities Pa- Levitin 
- Kholevo Theorem places an upper bound on the mutual information H{X : Y): 

H{X-.Y)<H{p)-Y,PaH{pa). 

a 

(Note that, if the channel used by Alice and Bob consists of L letters used 
independently, then the Levitin - Kholevo bound is just L x, where x is tbe 
Levitin - Kholevo bound for a single letter.) If the Alice’s input X has an entropy 
H{X) that exceeds H{p) — J2aPaH{Pa), then H{X\Y) > 0 and it will not be 
possible to make the probability of error Pe arbitrarily small. 

Suppose we fix an alphabet P = {Wx} of letter states Wx, and require 
that Alice use codewords a that are length-L strings of these letter states: 
a = xi-.-Xl- Then the probability distribution Pa yields marginal probabil- 
ity distributions p{xi), . . . ,p{xl) and average density operators Wi , . . . , Wl for 
the L different letters. It follows that 

H{p) -Y,PaH{pa) < {h{W^) ~Y,P{^l)H{Wx,) 

a \ x\ 

+ • • • + - ^p{xe)H{Wx,)^ (4) 

where we have used the subadditivity of the entropy H{p). We might write this 
as 

< Xl H 'rXL (5) 

where x*'^^ represents the Levitin - Kholevo bound for the ensemble of codewords 
of length L, and xi, ■ • ■ , Xi represent Levitin - Kholevo bounds for the individual 
letter ensembles. 
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We define the fixed- alphabet capacity Cr to be 

Cr = sup X (6) 

p[x) 

where p{x) is the probability distribution over the letters states in F and x is 
the single-letter Levitin - Kholevo bound. This quantity represents the maxi- 
mum information rate per letter that Alice can send to Bob with arbitrarily low 
probability of error. 

This claim follows directly from our results so far. Suppose Alice uses code- 
words of length L. Then x^^^ ^ LCp', and by the above argument, if Alice 
attempts to send more than LCr bits using these codewords then the proba- 
bility of error will not be arbitrarily small. Conversely, we can choose the letter 
probabilities so that x is as close as required to Cr, and we have previously 
shown that a suitable choice of code and decoding observable can convey up 
to X bits per letter with arbitrarily low Pe- Thus, the capacity Cr cannot be 
exceeded but can be approached arbitrarily closely. 

The mixed states Wx used in our alphabet are the states available to Bob for 
decoding. They may in fact not be the original states of the channel Q chosen by 
Alice. In the interval between Alice’s encoding and Bob’s decoding, the system 
Q may have undergone unitary internal evolution (which Bob can correct by 
a suitable choice of “rotated” decoding observable) and interaction with the 
external environment (which Bob cannot in general correct). 

The most general description of the evolution of a quantum system Q inter- 
acting with an environment is provided by a trace-preserving completely positive 
linear map on the set of density operators of Q Such a map is described by 
a superoperator £ : 

P — ^P' = £{p), (7) 

where p is the initial state of the system and p' is the final state. The super- 
operator £ acts linearly, so that a convex combination of input states yields a 
convex combination of output states. This description clearly includes unitary 
evolution of Q as a special case, but it also can account for interaction with the 
environment. 

A noisy quantum channel is defined by a superoperator £ that describes the 
evolution of each letter as it is transmitted from Alice to Bob. We assume that 
the channel is memoryless — i.e., that the evolution of each letter is independent. 
This means, among other things, that a product state of several input letters 
will evolve into a product state output. 

Alice’s basic problem is to use input states Wx so that the output states 
Wx = £{wx) can be distinguished by Bob. If Alice has a fixed alphabet {tCa,} 
of input states, then the maximum achievable information rate per letter is still 
given by our fixed-alphabet capacity Cr, where F is the alphabet of output 
states. 

Now suppose that Alice is allowed to choose her input states in order to max- 
imize the information conveyed to Bob over the noisy quantum channel, subject 
to the constraint that Alice must transmit codewords which are represented by 
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product states of the letters. This almost reduces to the fixed-alphabet problem, 
where the fixed alphabet T now includes all of the possible output states of the 
channel. The maximum over probability distributions is now a maximum over 
all input ensembles of states chosen by Alice. 

We say that this problem almost reduces to the fixed alphabet problem in 
that the argument that x is an upper bound of the capacity must be modified 
in this case. Recall from the previous section that we applied the classical Fano 
inequality to show that if Alice attempts to send information at a rate exceeding 
y then the probability of error cannot be made arbitrarily small. If we attempt 
to use the same argument in the present case then the Fano inequality does not 
help us for at least two reasons. First, the number of possible input states 
is unbounded. Second, we do not have a characterization of H{XlineY) that 
allows us to compare it with N^- Thus we will modify the Fano inequality to 
understand the behavior of the probability of error in the present case. 

We first note that the probability of “getting it right” 



1-Pe 




ak 



( 8 ) 



is linear in the elements of the POM. Thus the probability of error, Pe is a convex 
function on the elements of the POM. We may modify the proof of a result of 
Davies (Theorem 3 of ^2j) to show that the convex function Pe is minimized 
by a POM having no more than (P elements, where d is the dimension of the 
support of the POM. Thus, the probability of error is minimized by a decision 
scheme in which at most of the inputs are identified by the decision scheme. 
Let us denote the output of such a scheme by Pmm- Fano’s inequality gives us 
that 

- PeIoePe - (1 - PB)log(l - PE)+PElog{d^ - 1) > H{X\Y^,n). (9) 

Note that 

H{X\Ym) = H{X) - H{X : Ymmin) (10) 

>H{X)-x, (11) 

so that we conclude 

- Pe log Pe-{1- Pe) log(l - Pe) + Pe log(d' - 1) > H{X) - y. (12) 



Note that this is a relation between the minimum probability of error and a 
quantity (H(X) — y) which does not depend on the particular decision scheme. 
We see that if Alice attempts to send information at a rate H{X) in excess of y 
then the probability of error can not be made arbitrarily small. 

We now turn to a demonstration that this rate can be achieved. Alice wishes 
to choose a set of input states Wx (together with input probabilities Px) so that 
y is maximized for the output states Wx- We next show that Alice can do no 
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better than choose the input states Wx to be pure. Let a set of (possibly mixed) 
input states Wx be given along with their a priori probabilities, and let 

W = '^PxWx = '^Px£{Wx) (13) 

X X 

be the average output state. Then 

X = H{W)-Y^PxH{S{wx)). (14) 

X 

Construct a new set of pure state inputs by resolving each mixed state input 
into a convex combination of pure states: 

'^x — ^xk I '^xk) i'^xk I • (1^) 

We will use the state | ipxk) with probability Pxk = Px ^xk- By linearity, 

Wx = £{Wx) = '^XxkS{\i’xk) {lpxk\), (16) 

k 

SO that the average output state is still W, as before. By the convexity of the 
von Neumann entropy, 

i^(W"x) >^A,fcif(£:(|V’xfc)(V'.fc|)- (17) 

k 



It follows that 



x' = H{W) - '^PxkH{S{\ ipxk) {i’xk I) 

xk 

>H{W)-Y,P<^H{S{wx)) = X- (18) 

X 

In other words, for any ensemble of mixed input states, we can find an ensemble 
of pure input states whose output states have a x at least as great. The optimal 
inputs for the noisy quantum channel are pure states. 

To sum up, if Alice is required to use product states to represent her code- 
words, then the capacity of the noisy quantum channel is 

( 7 ( 1 ) 

= maxx (19) 

where x is the Levitin - Kholevo bound for the output states of the channel, 
and the maximum is taken over all ensembles of pure state inputs. Alice can 
reliably transmit information to Bob at any rate below We will refer to 

(7(1) 

as the product state capacity. The superscript (1) reminds us that Alice is 
required to use the multiple available copies of the channel one at a time, coding 
her messages into product states. 
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3 Coherent Quantum Information and Quantum 
Capacity 

The entropy exchange Se measures the amount of information that is exchanged 
between the system Q and the environment E during their interaction. If the 
environment is initially in a pure state, the entropy exchange is just the envi- 
ronment’s entropy after the interaction — i.e., Se = S{p^ ), where is the final 
state of E. (The entropy here is just the ordinary von Neumann entropy of a 
density operator, S{p) = — Trplogp.) The entropy exchange is entirely deter- 
mined by the initial state of Q and the channel dynamics superoperator 
that is, the entropy exchange is a property “intrinsic” to Q and its dynamics. 
The coherent information Jg, introduced in m, is given by 

Ie = S{pQ')-Se. ( 20 ) 

The coherent information has many properties that suggest it as the proper 
measure of the quantum information conveyed from Alice to Bob by the channel. 
For example, le can never be increased by quantum data processing performed 
by Bob on the channel output, and perfect quantum error correction of the 
channel output is possible for Bob if and only if no coherent information is lost 
in the channel m- Finally, the coherent information seems to be related to the 
capacity of a quantum channel to convey quantum states with high fidelity m- 
Alice might be using the channel to send classical information to Bob. Alice 
prepares Q in one of a set of possible “signal states” p^ , which are used by Alice 
with a priori probabilities pk- The average state p^ is given by 

P^ = ^Pkp2- ( 21 ) 

k 

Bob receives the fcth signal as pf = £^{pf). Because the superoperator is linear, 
the average received state is 

p<3'=^Pfc£0(pQ)=£:0(p0). (22) 

k 

Bob attempts to decode Alice’s message (that is, to identify which signal state 
was chosen by Alice) by measuring some decoding observable on his received 
system Q' . 

The amount of classical information conveyed from Alice to Bob, which we 
will denote HBob, is governed by the quantity , defined by 

X^' =S{p^')-Y.PuS{p2'). 

k 



( 23 ) 
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This quantity is significant in two ways: 

— Hsob < j regardless of the decoding observable chosen [21I22| . 

^ Hsob can be made as close as desired to to x^ by a suitable choice of code 
and decoding observable. To make Hsob near ? Alice must in general use 
the channel many times and employ code words composed of many signals; 
Bob must perform his decoding measurement on entire code words. The net 
result is that the channel is used N times to send up to bits of classical 
information reliably | 2 |. 

In short, represents an upper bound on the classical information conveyed 
from Alice to Bob, an upper bound that may be approached arbitrarily closely 
if Alice and Bob use the channel efficiently. 

If this general picture is used to describe a noisy quantum channel, then we 
need to account for the information that is passed to the environment. Recall 
that the evolution superoperator describes all of the effects of the channel; 
or, to put it another way, all of properties of the link between Alice and Bob 
are contained in the interaction operator . The information passed to the 
environment He will be limited by 

x^' =S{p^')-Y,PkS{p^'). (24) 

k 



Assume that the states of Q initially prepared by Alice are pure states 
also recall that the environment E can be presumed to begin in a pure state | 0^). 
After Q and E interact unitarily, the joint state ^ ^ ® | 0^) 

will also be a pure state, generally an entangled one. The subsystem states, 
described by density operators 



Q' 

Pk 


II 






pf 


<y 

II 


^QE') 


(g,QE' 



(25) 



will have exactly the same non-zero eigenvalues, so that S{p‘^ ) = S{p^ ). There- 
fore 



iQ = S{p^')-Se 
= S{p^')-S{p^') 

= S{p^') - Y^PkSipf) - S{p^') +Y,PkS{pf) 

k k 

jQ^^Q'-X^'. (26) 

It is interesting to note that, although both and x^ depend on the choice 
of pure state inputs for the channel Q, the difference x^ ~ X^ depends only on 
the overall density operator p^ for the inputs. 
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In is shown in that perfect error correction is possible if and only if the 
coherent information of the channel equals the entropy of the input state. The 
quantity De is defined m as follows: 

De = S{p^) (27) 

We can thus say that perfect error correction is possible if and only if De = 0. 

Subtracting each side of equation (26) from the entropy of the input state 
yields: 

S{p^) - iQ = D, = S{p^) - (28) 

Recall that is maximized when Alice uses pure state to encode her messages. 
In that case we have = S{p'^), so equation (28) takes the form 

D, = x"^ - + (29) 

This equation is quite informative. It implies that conveying quantum infor- 
mation perfectly depends on two tasks: Maximization of classical capacity and 
zero entropy loss to the environment. This implies a strong connection between 
the quantum capacity of a quantum channel and its classical capacity. 

We would like to thank W. K. Wootters and M. A. Nielsen for helpful con- 
versations and suggestions. 
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Abstract. Strengthened Lindblad inequality has been proved. We have 
applied this results for proving a generalized iL-theorem in non equi- 
librium thermodynamics. Information processing also can be considered 
as some thermodynamically process. From this point of view we have 
proved strengthened data processing inequality in quantum information 
theory. 



There are close connection between statistical thermodynamics and informa- 
tion theory m- It is well known that physical ideas played an important role as 
sources of information theory P|. In the other hand, the concept of information 
is crucial for understanding some important physical problems such as Maxwell 
’’demon” ^ or general problem of quantum correlation between two quantum 
systems [S]. 

In this paper we concentrated on the two connected problems. There are the H- 
theorem problem in non equilibrium thermodynamics and problem of quantum 
data processing in quantum information theory. 

Suppose that a quantum system is described by density matrix p{t) at the mo- 
ment t. In the general case evolution of the non equilibrium system in markoffian 
regime is described by some general quantum evolution operator 

S{t',t)p{t) = p{t') (1) 

This most general deterministic quantum evolution operator S has kraussian 
representation 

Sp = Y^AlpA„ Y.A,Al = i. (2) 

fl fl 

This operators must be linear, completely positive and trace-preserving IfHHI . 
The 0 contains unitary evolution, nonselective measurement, partial trace, et 
all. If the evolution of the system is in the stationary markofhan regime then 

= S{t' -t) (3) 

Stationary markoffian regime is reasonable conjecture if the system is not far 
from equilibrium m 
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When the system is open we need some values, which must characterize the 
degree of nonunitarity for evolution of our system. The usual characteristics for 
this case is entropy. This quantity was introduced in quantum statistical physics 
by J. von Newmann. 

'S'(p) = -trplnp (4) 

As we will see later for some more general problems we need a more general quan- 
tity. Quantum relative entropy between two density matrices pi, p 2 is defined as 
follows 

5'(pi||p 2) = tr(pilogpi - pilogp 2 )- (5) 

This positive quantity was introduced by Umegaki H2) and characterizes the de- 
gree of ’closeness’ of density matrices pi, p 2 - The properties of quantum relative 
information were reviewed by M.Ohya m Here only two basic properties are 
mentioned. 

5'(pi||p 2) > S'(5pi||5p2)- (6) 

S{Xpi -I- (1 - \)ai\\\p2 + (1 - A)ct 2) < AS'(pillcri) -I- (1 - A)S'(p2||ct2). (7) 

Where 0 < A < 1. The first inequality was proved by Lindblad H2]. 

When we have a usual iJ-theorem? If m has a stationary density matrix pst 
and this pst equal unit matrix (up to unessential normalization constant), then 

5(t'-f)l = l (8) 

for any t' — t. The condition pst ~ 1 is a usual microcanonical distribution in 
equilibrium statistical physics. Then from (01 we have 

S{Sp) > S{p) (9) 

In general case (when pst / 1, it is usual case in the theory of open systems) 
entropy of von Newmann is not monotonically increasing function with time 
and for some open system can exhibit periodic behavior M In general case the 
author of 0 proposed to use as measure of nonunitarity of evolution the relative 
entropy between pst, p- Indeed, if we define the entropy of our system at the 
time t as 

- S{p{t)\\pst) (10) 

then from (0 we see that for any type of evolution II 1 1 III is monotonic function 
with time. Furthermore (mu has all physically important properties of usual en- 
tropy 0: it is positive, additive and convex HH. 

Now the following question arises. Can we generalize the ( 0 ) without any re- 
strictions? If the answer is yes, then we can prove with this result more general 
iJ-theorem. Let us assume in formula (0 that 

S = cCi + {1 - c)C2, (11) 

where Ci is defined by kraussian representation = |p)(0|, (p|/i) = (0|0) = 

1, 0 < c < 1. In other words for any operator p: C\p = |0)(0|. Now from ( 0 ), 
m we get 
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= S{cCip+{l - c)C 2 p\\cCia + (1 - c)C 2 o) 

< cS{Cip\\Cia) + (1 - c)S{C 2 p\\C 2 <y) < (1 - c)5(p||a). (12) 

We see that if S is represented in the form mu the ordinary Lindblad inequality 
can be strengthened. 

Now we need some general results from theory of linear operators !IZ|. Let two 
hermitian operators A and B have the spectrums a\ < ... < an, < bn. 

For the spectrum ci < . . . < c„ of the operator C = A + B we have 

a± “t“ 6/e ^ C}i ^ 6/e “t“ an, 6i A u/e A C}i A a]^ A bn- 

where k = 1, If 

p' = Sp = cCip A (1 — c)C 2 P 
= c|0)(0| A(l-c)a, (14) 

and Pi < ... < p'n, (Ji < ... < Un are the spectrums of p', ct then we have 

p'l - c < cri(l - c) < TAm{p'i,p'n - c), 

max(p) , p'/e - c) < CT/e (1 - c) < p'/e , (15) 

where k = 2, ...,n. We define c{S,p) as the minimal eigenvalue of p' and c{S) = 
miiip c{S,p) where minimization is taken by all density matrices for the fixed 
Hilbert space. With the well known results of operator theory ini we can write 

c(5) = min min {ip\SpM, (16) 

P <p|p)=i 

where the second minimization is taken by all normal vectors in the Hilbert 
space. For any density matrix p we get to the formula d) where c is defined in 
m and C 2 is some general evolution operator. Now from we get the 

strengthened Lindblad inequality 

(1-C)5(pi||p2)>5(5pi||5p2). (17) 

The equations dSI), (d are our general results. Of course there are many evo- 
lution operators S with c(S') = 0 but later we shall show that our results can be 
nontrivial because for some simple but physically important case c{S) is nonzero. 
From (PJ, (CD we immediately get to strengthened i7-theorems. 

Now about application of this result in quantum data processing. 

Quantum information theory is a new field with potential applications for the 
conceptual foundation of quantum mechanics. It appears to be the basis for a 
proper understanding of the emerging fields of quantum computation, commu- 
nication and cryptography (see 0 for references). Quantum information the- 
ory concerned with quantum bits (qubits) rather than bits. Qubits can exist 
in superposition or entanglement states with other qubits, a notion completely 
inaccessible for classical mechanics. More general, quantum information theory 
contains two distinct types of problem. The first type describes transmission of 
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classical information through a quantum channel (the channel can be noisy or 
noiseless). In such scheme bits encoded as some quantum states and only this 
states or its tensor products are transmitted. In the second case arbitrary super- 
position of this states or entanglement states are transmitted. In the first case 
the problems can be solved by methods of classical information theory, but in 
the second case new physical representations are needed. 

Mutual information is the most important ingredient of information theory. In 
classical theory this value was introduced by C. Shannon m- The mutual in- 
formation between two ensembles of random variables X, Y (for example this 
ensembles can be input and output for a noisy channel) 

I{X,Y)=H{Y)-H{Y/X), (18) 

is the decrease of the entropy of X due to the knowledge about Y, and conversely 
with interchanging X and Y. Here H{Y) and H{Y/X) are Shannon entropy and 
mutual entropy m- 

Mutual information in the quantum case must take into account the specific 
character of the quantum information as it is described above. The reasonable 
definition of this quantity was first introduced by S. Lloyd 0, and independently 
by B. Schumacher and M.P.Nielsen 0. Suppose a quantum system with density 
matrix 

i i 

We only assume that (ipilipi) = 1 and the states may be non orthogonal. The 
noisy quantum channel can be described by some general quantum evaluation 
operator S. 

As follows from definition of quantum information transmission, a possible dis- 
tortion of entanglement of p must be taken into account. In other words definition 
of mutual quantum information must contain the possible distortion of relative 
phases of quantum ensemble {|V'i)}- Mutual quantum information is defined as 

03 

/(p; 5) = S{Sp) - S{i^ 0 (20) 

^ I 0 ^(|V’.)(^,I). (21) 

Where S{p) is the entropy of von Newman and is a purification of p 

IV'"') = E ® (22) 

i 

tiR\'tp^){'tp^\ = p, (23) 

here is some orthonormal set. The definition is independent from con- 

crete choice of this set jSI. The mutual quantum information is the decrease of 
the entropy after acting of S due to the possible distortion of entanglement state. 
This quantity is not symmetric with respect to interchanging of input and out- 
put and can be positive, negative or zero in contrast with the Shannon mutual 
information in classical theory. 
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It has been shown that can be the upper bound of the capacity of a quantum 
channel IBIIHI- Using this value the authors |H| have been proved the converse 
coding theorem for quantum source with respect to the entanglement fidelity |^1 • 
Only this fidelity is adequate for quantum data transmission or compression. 

In the 13 the authors prove data processing inequality 

I{p-,S,)>I{p-S 2 Si). (24) 

In |BI we found the alternative derivation of this result which is more simple than 
derivation of [Z). In this paper we show that this equation can be strengthened. 
Data processing inequality is very important property of mutual information. 
This is an effective tool for proving general results and the first step toward 
identification a physical quantity as mutual information. 

Now we brief recall the derivation of data processing inequality in the general 
case. The formalism of relative quantum entropy is very useful in this context 
fIIII3| . 

We have 



S{i^ (g) 5(|V’^)(V’^|)||P 0 S{p^ 0 p)) 

= -s{i^ 0 + s{Sp)). (25) 

Here 

= KV'ilV’i)- (26) 

Now from Lindblad inequality we have 

s{i^ 0 5(iV'^)(V'^i)iir« (g) s{p^ 0 p)) 

> 5(1« 0 5i52(|V'^)(V'^|)||1« 0 0 p))- (27) 

From this formula we have ( 123 ). From ( 123 ) we have 

I{p; cSi + (1 - c)^ 2 ) < clip; Si) + (1 - c)I{p; S 2 ) 

This theorem have been proved in US). 

Now we can prove the strengthened data processing inequality. Let in ii'Z'd) S 2 is 
represented in the form (El). From (iHim we get 

Sii^ 0 S2Sii\i^^){i^^\)\\i^ 0 S2Siip^ 0 p)) 

< -(1 - c)S{i^ 0 C 2 Sii\i;^){i;^\)) + S{p^) + (1 - c)S{C 2 Sp)). (28) 

And we have 

(1-c(52))J(p;^i)>/(p;525i). (29) 

Now we consider the simplest example of noisy quantum channel: Two dimen- 
sional, two- Pauli channel uni 



Ai = i/xi, A 2 = y(l - x)/2tTi, A 3 = - a:)/2(T2, 0 < x < 1, (30) 
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where 1, cti, tT 2 are the unit matrix and the first and the second Pauli matrices. 
The ® has also physical meaning as evolution operator for two-dimensional 
open system. 

Any density matrix in two-dimensional Hilbert space can be represented in the 
Bloch form 

p=(l + a^)/2, (31) 

where a is a real vector with \a\ < 1. Now we have 

S'tp(( 1 -h acr)/2) = (1 -h 6cr)/2, (32) 

where b = (oia:, 02 a;, aa(2a: — 1)). After simple calculations we get 

c{STp) = il-\2x-l\)/2. (33) 

We conclude by reiterating the main results: Lindblad inequality can be gener- 
alized. We have results not only about increasing of the entropy and decreasing 
of the mutual quantum information but also about velocity of these processes. 
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Abstract. Instead of a quantum computer where the fundamental units 
are 2-dimensional qubits, we can consider a quantum computer made up 
of d-dimensional systems. There is a straightforward generalization of the 
class of stabilizer codes to d-dimensional systems, and I will discuss the 
theory of fault-tolerant computation using such codes. I prove that uni- 
versal fault-tolerant computation is possible with any higher-dimensional 
stabilizer code for prime d. 



1 Introduction 



Quantum computation and quantum communications have the potential to ac- 
complish many things that would be difficult or impossible using just classical 
computers and communications. However, quantum data is very vulnerable to 
decoherence and to errors. It is likely that some form of quantum error cor- 
rection will be needed to perform anything beyond the simplest computations 
with a quantum computer. Quantum error-correcting codes [1 provide 

one of the tools necessary. Such a code can protect quantum data against errors 
occuring during transmission or storage of the data. However, to have a reli- 
able quantum computer, we also need for the computation to be performed in 
a fault-tolerant manner p]. Fault-tolerant quantum computation requires a pro- 
tocol that not only maps states of a quantum code to other states of a quantum 
code, but prevents errors from propagating out of control. 

A large group of useful codes was introduced in |3| and ^ . These codes are 
essentially the quantum equivalent of classical linear codes, in that they can be 
easily described and encoded, and that it is easy to measure the error syndrome 
(as in the classical case, it may be difficult to compute the actual error from the 
error syndrome). The primary complication involved in quantum error correction 
is that it is not only necessary to correct bit flip errors 



but also phase errors 




( 1 ) 

( 2 ) 



C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 302-^3 1999. 
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Consequently, it turns out to be useful to look at the group V generated by 
tensor products of these operators. V is called the Pauli group or the extra- 
special group. Stabilizer codes are those codes where the valid codewords are all 
eigenstates of n—k operators in V. These n—k operators generate a 2"“^ element 
Abelian group, called the stabilizer S of the code. The set of valid codewords 
forms a 2 ^-dimensional subspace of the full n-qubit Hilbert space, the coding 
space of the code. The stabilizer is analogous to the parity check matrix of a 
classical linear code. In fact, the set of classical linear binary codes is exactly 
the set of stabilizer codes where everything in the stabilizer is a tensor product 
of Z operators. 

Stabilizer codes are easy to work with because of the structure of the Pauli 
group. 

AZ= -ZA , (3) 

so any two operators in V either commute or they anticommute. If an operator 
E anticommutes with an operator G in the stabilizer S' of a code, then E\^p) will 
have eigenvalue —1 for G instead of -1-1. Therefore, by measuring the eigenvalues 
of the n — k generators of S, we can easily measure the error syndrome, and if 
the code is suitably chosen, identify the error. 

Even if we restrict our attention to stabilizer codes, it is far from obvious 
that we can perform fault-tolerant computation. First of all, we must find some 
operators that map the coding space of the code into itself. Secondly, many of 
these operators will cause errors to spread from one qubit in a block to a different 
qubit in the same block of the code. Therefore, a single error could rapidly grow 
to become many errors within a block, exceeding the code’s capability to correct 
them. In order to avoid this, we will further restrict attention to transversal 
operations, that is, operations which only interact qubits from one block with 
corresponding qubits in other blocks. This means that an error occuring in qubit 
number 3 in one block might spread to qubit number 3 in another block, but 
it will never spread back to qubit 2 in the first block. Each block can easily 
correct a single error, so this situation causes no problems. Shor was the first to 
demonstrate a universal set of fault-tolerant gates jOj. However, his construction 
only worked for a small class of codes. In 0, I was able to show that Shor’s 
construction could be generalized to any qubit stabilizer code. The proof made 
extensive use of the group of unitary operators that leave the group V invariant 
under conjugation. This group is known as the Clifford group. 

In the classical theory of error-correcting codes, it is often helpful to go 
beyond bits and work with higher-dimensional systems. The same may be true for 
quantum error correction. Instead of a system made up of 2-dimensional qubits, 
we can work with a system composed of d-dimensional qudits. It turns out that 
there is a natural generalization of stabilizer codes to higher-dimensional systems 
pMTTij . In Sec. 0 I will present this generalization. Then I will precede to 
generalize the arguments of |Z] to show that universal fault-tolerant computation 
is also possible with any of these codes, at least in the case where d is prime. 
Though I will largely focus on prime d, I will also say a little bit about the general 
case along the way. Assume that d is prime unless it is otherwise specified. 
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The construction of a universal set of gates used in [3 consisted of a number 
of steps: 

1. The full Clifford group can be constructed given the controlled-NOT, oper- 
ators in the Pauli group, and measurement of operators in the Pauli group. 

2. For any stabilizer code, we can perform encoded versions of all operators in 
the Pauli group and can measure operators in the Pauli group. 

3. We can perform a CNOT between corresponding encoded qubits in different 
blocks of the code. 

4. We can swap individual encoded qubits from one block of the code to an 
empty block and vice-versa. 

5. We can move an encoded qubit in an otherwise empty block to whatever 
position in the block we desire. At this point, we have established that we 
can perform a CNOT between any pair of encoded qubits in the same or 
different blocks, and therefore can perform the full Clifford group. 

6. Given the full Clifford group, we can perform an additional gate outside the 
Clifford group, such as the Toffoli gate or the tt/S rotation. This completes 
the universal set of gates. 

Each of the steps in this construction has an analog for higher-dimensional sys- 
tems. However, in this paper, I will present a simplified construction that com- 
bines steps 0 through 0 This construction gives a direct way to perform the 
generalization of the CNOT between any pair of encoded qudits, whether they 
are in the same or different blocks and whether they are in corresponding or 
different positions within their blocks. This construction can also be used to 
simplify the proof for systems with d = 2, and will likely reduce the required 
overhead for fault-tolerant computation using codes with many qudits per block. 

2 Higher Dimensional Generalization of the Pauli Group 
and Other Structures 

The Pauli group has a natural generalization to higher-dimensional systemsQ 
Instead of generating it from the two-dimensional X and Z, we instead generate 
V from tensor products of Xd and Zd, where Xd\j) = \j -I- 1) and Zd\j) = uj^\j), 
where w is a primitive d-th root of unity. Xd and Zd satisfy the relation 



Both Xd and Zd have order d. The elements of the single-qudit Pauli group have 
the form uj°‘X^Z^, where 0 < r, s < d, and 




( 4 ) 



{X^dZ2 ■ ( 5 ) 



( 5 ) 



^ The group presented in this section is not the only generalization of the Pauli group, 
although it is probably the simplest. See jH] for a more extensive discussion of this 
issue. 
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V for n qubits will contain elements, plus an additional factor of d for overall 
phase. The elements of V have eigenvalues w’’ for r — 0, ... ,d— From now 



on, I will suppress the subscript d, and all operations should be taken to be over 
qudits instead of qubits. 

The d-dimensional generalization of the stabilizer S' of a code is again just an 
Abelian subgroup of V. The coding space is composed of those states that are 
fixed by all elements of S (when d is even, this actually imposes an additional 
constraint on the overall phase of elements of S). If the stabilizer on n qudits 
has n — k generators, then S will have d"“^ elements and the coding space will 
consist of k qudits. Note that this last fact need no longer be true when d is not 
prime, and this is the main source of complications in that case. It is unclear 
exactly how to deal with a code that does not encode an integral number of 
qudits. If we stick to codes for which all the generators of the stabilizer have 
order d, the rest of the proof will hold, modulo a question about gates necessary 
to generate the Clifford group. 

If an operator E and M G S satisfy 



then E\ij}) will have eigenvalue for M instead of eigenvalue -1-1, so we can 
detect that error E has occurred by measuring the eigenvalue of M . 

We can see the structure of the coding space by extending the generators of S 
to a complete independent set of commuting operators. When d is not prime, we 
also require these operators to have order d. Such a set will have cardinality n, so 
we can do this by choosing k additional operators Z\, . . . , Zk- These operators 
have the interpretation of the encoded Z operators for the k encoded qudits. We 
can then choose k more operators Xi, . . . ,Xk which satisfy the relations 



The operators Xi then act as the encoded X operators for the k encoded qu- 
dits. The generators of S along with Zi, ... ,Zk and Xi, . . . ,Xk then generate 
the group of all Pauli group operators that commute with S. As in the two- 
dimensional case, the operators that commute with S but are not themselves 
in S are precisely the operators that cannot be detected by the quantum code. 
They therefore perform encoded operations on the data. 

The Clifford group is the set of operators that leave V invariant under con- 
jugation. That is, it is the normalizer N{V) of V in the unitary group U{dE). 
The Clifford group is important for fault-tolerant computation because if we 
perform an operator U on the Hilbert space, the operator UNW has the same 
relationships to states after the transformation as the operator N did before the 

This is true for odd d. For even d, XZ has order 2d, so extra factors of i will be 
necessary, as in the d = 2 case. This aspect is actually simpler for odd d than for 



EM = lo‘^ME , 



( 6 ) 




( 7 ) 

(8) 

(9) 



d = 2. 



2 
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transformation. Therefore, instead of considering transformations of the states 
1^) ^ U\ip), we can consider transformations of the operators N UNW . 
When U is in the Clifford group and N is in the Pauli group, then UNU^ will 
also be in the Pauli group. Therefore, we can uniquely describe elements of fV(7^) 
by the permutation they induce on V. The permutation must preserve the group 
structure of V, but is otherwise arbitrary. 

In the two-dimensional group, N(V) was generated by two single-qubit op- 
erators R (the Hadamard transform |j) ^ |0) -I- (— 1)^ |1)) and P (the phase gate 
|j) ^ ^nd the two-qubit operator CNOT (|t)|j) ^ |*)|(* + j) mod 2)). In 

d dimensions, R generalizes to the d-dimensional discrete Fourier transform 

d 

, (10) 

P generalizes to the d-dimensional phase gate 

\j) ^ ^ ( 11 ) 

and CNOT generalizes to the SUM gate 

l*)|j) ^ !*)!(* + j) mod d) . (12) 

We can describe these operators by their induced transformations on the 
Pauli group. R maps 



P maps 



SUM maps 



X ^ Z , (13) 

X-i . (14) 

X ^ XZ , (15) 

Z ^ Z . (16) 

X®I ^ X®X , (17) 

I®X^I®X , (18) 

Z ® I ^ Z ® I , (19) 

I®Z^Z~^®Z . (20) 



However, it is not clear that these three gates generate the Clifford group. We 
may also need the S gate 

X ^ , (21) 

Z ^ Z^ , (22) 

for all pairs (a, &), where ah = I mod d. On kets, this gate acts as |j) ^ |aj). 
In fact, a single pair (a, b) is sufficient, as long as a generates the multiplicative 
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group I will not give a detailed proof that these gates generate the Clifford 
group, but using the P, R, and S gates, we can get the full one-qudit Clifford 
group. Then a construction similar to that used in [3 will give the full n-qudit 
Clifford group. The structure is somewhat more complicated when d is not prime, 
and I have not verified that these gates are sufficient for the nonprime case. 

Note that we can fault-tolerantly measure any operator in V that is the tensor 
product of Z operators by performing transversal SUM gates from the qudits 
to be measured to an appropriate ancilla. Since we are interested in eigenvalues 
with possible values oj^ for j — 0, . . . ,d — 1, the appropriate ancilla state is the 
superposition of all states where the sum of the qudits is 0 mod d. That way, no 
information beyond the eigenvalue of the measured operator will be conveyed. 
We can construct this state by Fourier transforming the state ' ' ' j)- 

Following DiVincenzo and Shor El, we can measure any operator in V by 
performing a transversal Clifford group operation C that takes the operator to 
the tensor product of Z’s, performing the measurement, and applying C~^. 

For any stabilizer code, the elements of the fc-qudit encoded Pauli group 
are also elements of the n-qudit unencoded Pauli group, as are the generators 
of the stabilizer. Since we can perform and measure an arbitrary element of 
the unencoded Pauli group, we have shown that for a stabilizer code over d- 
dimensions, we can apply encoded versions of X and Z for all encoded qudits, 
measure the generators of the stabilizer (and therefore perform fault-tolerant 
error correction), and measure all members of the encoded Pauli group fault- 
tolerantly. This provides step 0 of the proof. 



3 Measurements and Stabilizers 

In 0, it proved very helpful in a number of places to understand how the sta- 
bilizer of a state or subspace changed under measurements. The procedure for 
qubits generalizes easily to higher dimensions. 

First, recall that there is more than one way to choose generators for a given 
stabilizer. Any maximal set of independent operators in the group will suffice. In 
particular, any generator M can be replaced by NM for any N ^ M. Similarly, 
the encoded X and Z operators are only defined up to multiplication by elements 
of S. If we wish to measure an operator A G V, then the first step is to put the 
stabilizer and X and Z operators in a form so that all the X and Z operators 
commute with A and all but one of the generators of S commutes with A. We can 
do this because \i M G S does not commute with A, then M°-N will commute 
with A for some a for any N that commutes with M (as do all A’s, Z’s, and 
generators of S) . We will not need to consider the case where A commutes with 
everything in the stabilizer. 

This is a useful form for the stabilizer because any operator that commutes 
with A is not disturbed by the measurement of A. Therefore, we only need to 
change M when A is measured. Since A G V, the possible measurement results 
are for a = 0, . . . , d—1. These result corresponds to applying the projection 
operator 
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P,= (^I + + • • • + /d (23) 

to the state. Assume now that 

MA = ujAM (24) 

(note that when d is prime, this will always be true for some power of M). Then 

^ d-l ^ d-l 

MPaM^ = ^ d'^ A^' = Pa-i . (25) 

j=0 j=0 

Thus, if we measure A and get the result a, by applying M“ we can produce the 
same state we would have gotten if we had gotten the result 0. 1 will assume below 
that any measurement is followed by such a correction. Once this correction is 
performed, the new state is a +1 eigenvector of A, so A should be added to the 
stabilizer. It is not an eigenvector of M, so M is removed from the stabilizer. 
All of the other generators (which have been put in a form where they commute 
with A) are unchanged. 

4 Gates Derived from SUM 

Suppose we have the ability to perform the SUM gate between any pair of qudits 
in our computer, as well as the ability to perform the Pauli group and to measure 
operators in the Pauli group. I will now show that we can apply the full Clifford 
group to the computer. 

Suppose we consider a single unknown qudit and prepare a second ancilla 
qudit in the state |0). This two-qudit system can be described by the stabilizer 
I ® Z. The logical Pauli group is generated hy X = X ® I and Z = Z Z> I- Now 
perform a SUM gate from the first qudit to the second qudit. The stabilizer is 
Z~^ Z) Z, X = X X, and Z = Z Z I. 

Suppose we were now to measure the operator A = I Z XZ. Then M = 
Z~^ Z Z G S and MA = loAM. Therefore, this measurement results in the 
stabilizer I Z XZ, and X = XZ~^ Z XZ and Z = Z Z I- We can discard the 
second qudit, and the effective transformation on the first qudit is 

X ^ XZ~^ , 

Z ^ Z . 

This is the gate P~^ . d — \ iterations of it will produce the P gate. 

Alternatively, we could have prepared the ancilla qudit so that the stabilizer 
of the system began as I Z X, then performed the SUM gate from the second 
qudit to the first. The stabilizer would then be X 0 X, and X = X Z I and 
Z = Z Z Z~^. Then we measure A = I Z XZ~^ and choose M = X Z X so that 
MA = ujAM. The final stabilizer is I Z XZ~^, so we discard the second qudit, 
leaving 



(26) 

(27) 



Z^XZ . 



(28) 

(29) 
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Call this gate Q. Then R~^ = XQP~^Q, and R = R~^. 

Now suppose we again prepare the ancilla in the -1-1 eigenstate of X, but 
now perform s SUM gates from the second qudit to the first instead of one. The 
stabilizer is X^ ^ X, X = X 0 I, and Z = Z ® This time we measure 
A = Z 0 I. This results in stabilizer Z ® I, 

X = {X ® I) {X^ ® X)~"~" = I ® X~^~\ (30) 

and Z = ZZ)Z~’^. Therefore, discarding the first qudit leaves the transformation 

, (31) 

Z ^ Z-’^ . (32) 

By choosing an appropriate s, we can therefore perform an arbitrary S gate. 
Note that in this case, the data ends up in what was originally the ancilla qudit. 

I have shown how to produce the P, R, and S gates from the SUM gate. 
Therefore, given the SUM gate, we can produce the full Clifford group. This 
completes step 0 of the proof. 

5 Producing the SUM Gate for Any Stabilizer Code 

To see how to construct the SUM gate between any pair of encoded qudits, first 
consider two unencoded qudits. Introduce a third qudit in the state |0). The 
stabilizer at this point is I®I®Z. Assume we can do Pauli group measurements, 
even entangled ones, and perform operators in the Pauli group. Let us first 
measure the operator / 0 A 0 X~^. This becomes the stabilizer. The logical 
Pauli group generators are 





~Xi = X ® I ® I , 


(33) 




~X2 = I ® X ® I , 


(34) 




'Zx = Z ® I ® I , 


(35) 




~Z 2 = I ® z ® z . 


(36) 


Now 


measure Z ® I ® Z.lt becomes the new stabilizer, and 






Ai = A 0 A 0 A"^ , 


(37) 




Aa = Jo A(g) 7 , 


(38) 




~Zi = Z®I®I , 


(39) 




'Z 2 = I ® z ® Z . 


(40) 


Finally, 


measure I ® I ® X and discard the last qudit. This leaves 


us with 




Ai = A O A , 


(41) 




Aa = JOA , 


(42) 




= ZO J , 


(43) 




Z 2 = z ^ 0 z . 


(44) 
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This we recognize as the transformation induced by the SUM gate, so this series 
of entangled measurements has performed the SUM gate between these two 
qudits. 

Now, to apply this to a quantum code, we just need to be able to measure 
entangled logical Pauli group operators between any pair of encoded qudits. If 
the qudits are in the same block, this is straightforward. For instance, if they 
are in slots i and j, the encoded version of X 0 X is just XiXj. This is in the 
Pauli group too, so we know how to measure it. 

If the qudits are in different blocks, it is not much harder. Instead of using 
an a-qudit ancilla state, we use an (a + 6)-qudit ancilla state (where a and b are 
the weights of the operators Xi and Xj), which is again in the superposition of 
all states whose registers sum to 0 mod d. The operator we wish to measure is 
Xi (E) Xj, which is in the Pauli group. By performing the appropriate transversal 
Clifford group operation, we rotate this to be the tensor product of Z’s and 
perform SUM gates from the appropriate qudits to the corresponding ancilla 
qudits, then perform the inverse Clifford group operator to restore the state to 
its original form. Then we measure the a + b ancilla qudits, and this tells us the 
eigenvalue of the measured operator. We use an (a + 6)-qudit ancilla instead of 
an a-qudit ancilla plus a 6-qudit ancilla because we do not wish to be able to 
find the eigenvalues of Xi ^ I and I (g> Xj separately, only their product. 

Therefore, given an encoded ancilla qudit which is initialized to |0), by per- 
forming the encoded version of the above entangled measurements, we can per- 
form a SUM gate between any pair of encoded qudits anywhere in the computer. 
Note that the ancilla qudit can itself be anywhere in the computer; it need not 
be in the same block as either data qudit, or in the corresponding place in a 
different block. 

Given the SUM gate and the results of the previous section, we can perform 
the full Clifford group on the encoded data for any stabilizer code. This completes 
the proof up to step El This part of the proof is a significant improvement on the 
method used in |Z]. In that paper, it was necessary to introduce full ancilla blocks 
to perform a CNOT. Here, we need only a single logical ancilla qudit. In the 
case where a block may encode many qudits, this can be a major improvement. 
The price is that we must potentially perform entangled measurements on more 
than one block. This means we will have to use larger ancilla states for the 
measurement; this results in a greater potential for error, so we will have to 
repeat the measurement more times, and perhaps perform error correction a bit 
more often. However, in many situations, the total number of physical gates we 
need will decrease. 

Note that this procedure works just as well if the two logical qudits in- 
volved in the SUM are in blocks made up of different numbers of physical qudits. 
This means we can interact qudits encoded with different sorts of codes fault- 
tolerantly, or change the encoding of a single qudit without losing the protection 
against errors at any time. 
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6 Completing the Universal Set of Gates 

The set of universal gates can be completed by adding the higher-dimensional 
analog of the Toffoli gate [El 



j)\b)\c 



\a)\b)\c + ah) 



(45) 



It turns out that a generalization of Shor’s fault-tolerant construction of the 
Toffoli gate jS| will work here. 

Suppose we prepare a three-qudit ancilla in the -1-1 eigenstate of the three 
operators 



Mi = {X®I®I) SUM(2 ^ 3) , 
M 2 = (J®X®7)SUM(1 ^ 3) , 
M3 = {I®I® Z) PHASE(1, 2)-^ 



(46) 

(47) 

(48) 



SUM(i ^ j) is a SUM gate performed with the Hh qudit as control and the jth 
qudit as target. PHASE(i,j) is the PHASE gate 



PHASE|a)|6) = w“>)|6) 



(49) 



performed on the Ah and jth qudits. One important fact to note is that both of 
these gates are in the Clifford group. Since we have already constructed the Clif- 
ford group, this will enable us to also construct the Toffoli gate. The appropriate 
state is 

1 ^) = ■ ( 50 ) 

a,& 

Now, given three data qudits, perform inverse SUM gates (i.e., \a)\b) 
\a)\b — a)) from the first and second ancilla qudits to the first and second data 
qudits, respectively, and a SUM gate from the third data qudit to the third ancilla 
qudit. Now we measure the last three qudits, the original data qudits, in the bases 
Z, Z, and X, respectively. After performing the appropriate corrections (which 
will likely involve gates from the Clifford group as well as the Pauli group), we 
are left with the data in the first three qudits, which were originally the ancilla 
qudits. It turns out that after these operations, a Toffoli gate has been performed 
on the data qudits. 

To construct the appropriate ancilla state | A), we can again follow Shor. The 
various states 

“ (51) 



\^j) = 55 I“)|5)|a5 + j) 



for j = 0, . . . , d — 1 are related to | A) by 

\Aj) = {I I X^) \A) . (52) 

The states |Ajj, like |A), are -1-1 eigenstates of Mi and M 2 , but M^lAj) = oj^\Aj). 
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Furthermore, note that 



d-l 

^|A,) = ^|a)|&)|c) . (53) 

j—0 a,b,c 

This last state is easily constructed as the Fourier transform of |000). Then by 
measuring M 3 for this state, we can collapse the state into one of the states \ Aj). 
By applying the operator X~^ , we get the state |A). 

To measure M 3 fault-tolerantly for a quantum code, we prepare the CAT 
state \ jj ■ ■ ■ j) (using the same number of qudits as in a block of the code). 

M 3 is in the Clifford group, so we can perform it by some sequence of tranversal 
operations and measurements. By conditioning the appropriate operations for 
the Ah qudit on the Ah qudit of the CAT state, we can conditionally perform 
M 3 on the code depending on the CAT state. Note that a conditional operation 
in the d-dimensional case means applying M| when the control qudit is in the 
state \j). Once we have done this, when the code is in an eigenstate of M 3 with 
eigenvalue w'*, the CAT state ends up in the state 

= ■ (54) 

j 

The various states \CATg) are orthogonal to each other, and so can be distin- 
guished by an appropriate measurement. This therefore gives us a measurement 
of M3, completing the construction of the Toffoli gate and step lil of the proof. 

Because everything we do is transversal, single qudit errors in the CAT state 
cannot become more than single qudit errors in any single block of the code. 
Naturally, after creating the CAT state we should verify it to make sure there 
are no correlated errors. In addition, a single qudit error in the CAT state could 
give us the wrong measurement result. Therefore, the measurement of M 3 should 
be repeated in order to sufficiently increase our confidence in the result. 
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Abstract. I report two general methods to construct quantum convo- 
lutional codes for quantum registers with internal N states. Using one 
of these methods, I construct a quantum convolutional code of rate 1/4 
which is able to correct one general quantum error for every eight consec- 
utive quantum registers. Keywords: Code Pasting, Convolutional Codes, 

Phase Shift Error, Spin Flip Error, Quantum Codes, Quantum Error 
Correction. 



1 Introduction 



Quantum error correction code (QECC) is a succinct way to protect a quantum 
state from decoherence. The basic idea behind all QECC schemes is that by 
suitably encoding a quantum state in a larger Hilbert space i?, and then later 
on measuring the wave function into certain subspace C of H, it is possible to 
detect the kind of errors that have occurred. Finally, one can correct the error by 
applying a suitable unitary transformation to the orthogonal complement of C 
according to the measurement result m- Many QECCs have been discovered in 
the last few years (see, for example. Refs. |4I6I7I8I111 Oil 511 811 
and various theories on the QECC have also been developed (see, for example. 
Refs. PI4I7I9I1()I15IH)I17IH)I21I22I25[ 1. In particular, the necessary and sufficient 
condition for a QECC is |dlltill7| 



(^encode I encode ) = ^A,B 



( 1 ) 



where |iencode) denotes the encoded quantum state |i) using the QECC; A,B 
are the possible errors the QECC can handle; and is a complex constant 
independent of |tencode) and |/encode)- 

All QECCs discovered so far are block codes. That is, the original state ket 
is first divided into finite blocks of the same length. Each block is then encoded 
separately using a code which is independent of the state of the other blocks 
{cf Refs. IBEI71 1. Besides block codes, convolutional codes are well known in 
classical error correction. Unlike a block code, the encoding operation depends on 
current as well as a number of past information bits 1131201 . For instance, given a 
(possibly infinite) sequence of classical binary numbers (oi, 02, ... , am, ■ ■ ■), the 
encoding (61, ci, &2, C2, . • . , bm, Cm, ■ ■ •) with 



bi = Qi + Ui-2 mod 2, Ci = ai + Ui-i -I- Oi_2 mod 2 



( 2 ) 



C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 314- IT??! 1999. 
© Springer- Verlag Berlin Heidelberg 1999 
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for all i, and oq = a_i = 0 is an example of classical convolutional code that can 
correct up to one error for every four consecutive bits (see, for example, chap. 4 
in Ref. ^ and Lemma 0 in Section 0 for details). 

In classical error correction, good convolutional codes often outperforms their 
corresponding block codes in the sense that they have higher encoding efficiencies 
CHOI- Thus, it is instructive to find quantum convolutional codes (QCC) and 
to analyze their performance. Here, I report two ways to construct QCCs. And 
from one of these methods, I construct a QCC of rate 1/4 that can correct one 
quantum error for every eight consecutive quantum registers (see Ref. 1111 for 
more details). 

2 Constructing Quantum Convolutional Codes from 
Quantum Block Codes 

In this Section, I report a general scheme to construct QCCs from quantum 
block codes (QBCs). But before doing so, let me first introduce some basic 
notations. Suppose each quantum register has N orthogonal eigenstates, where 
N is an integer greater than one. Then, the basis of a general quantum state 
making up of a collection of possibly infinite quantum registers can be chosen 
as {|fe)} = {|fci, ^ 2 , . . . , km, ■ ■ •)}) where km & for all m G ^ with N > 2. 
Moreover, I abuse the notation by defining km = 0 for all m < 0. Finally, all 
additions and multiplications in all state kets below are modulo N. 

Definition 1. Let \x) = ^ 2 , • ■ • , • ■ •) = J2{k}^k\k) be 

a quantum state. Any quantum error can be regarded as an error operator £ 
acting on this state. In particular, there is a spin flip error occurring at 
quantum register m (with respected to the basis {|fc)}/ if and only if £\x) = 
J2{k}°‘k\ki,k2, . ■ . ,km-i,km,km+i, ■ ■ ■), whcrc km{km,£) is a ^N-function of 
km and £. Moreouer, a spin flip error is said to be additive provided that 
km{km,£) = km + OL mod N for some a{£). 

Similarly, there is a phase shift error occurring at quantum register m 
(with respected to the basis {|fe)}/ if and only if £\x) = f{km,£)\k) for 

some complex-valued function f{km,£) with |/p = 1. Spin flip and phase shift 
errors occurring at more than one quantum register are defined in a similar way. 



With the above notations and definition in mind, a QBC and a QCC can be 
defined as follows: 

Definition 2. The linear map sending 



\k) = \ki,k2, . . . ,kn) 

' *■ *2) • ■ • ) *m) = Oi ^ I*) = |feencode) , (3) 

{i} 

(k) 

where a) G C, and ki G Zpf for all i = 1,2, . . . ,iV is said to be a quantum 
block code (QBC) that can correct errors in the set E if and only if Eq. Q) 
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is satisfied for all A,B G E. Since Eq. (0) encodes ever n quantum registers to 
m registers, the rate of this code is, therefore, defined as njm. In addition, one 
can encode the quantum state using the above QBC as l^incode)- 

On the other hand, if the encoding scheme expressed in Eq. 0) depends on 
current as well as past quantum states ( that is, the coefficients in Eq. 0 de- 
pend on more than one ), then it is called a quantum convolutional code 
(QCC). The rate of this convolutional code equals n/m because it asymptotically 
encodes every n quantum registers as m registers. 

With the above definitions in mind, one can construct a family of QCCs from 
a QBC as follows: 

Theorem 1. Given a QBC in Eq. and a quantum state |fc) = \ki) 

making up of possibly infinitely many quantum registers, then the mapping 



+ 00 

|fe) = (g)|fc. 

2 = 1 



+ CXD 



*^encode/ — ' 



E (^ fiipkp) ^ 



Ui} 



( 4 ) 



forms a QCC of rate n/m provided that the matrix pip is invertible. This QCC 
handles errors in the set E ® E ® E ® ■ ■ ■ . 



Proof. Let me consider the effects of errors £ = £i (g> £2 ^3 <8i • • ■ and £' = 

£'i ® E '2 ® ® • in E ® E ® E ® ■ on the encoded quantum registers by 

computing (fcgncodel^'^^l^encode)- From Eq. (P), I find that 



+ 00 



(^encodel*^ '^^^l^encode) — 



2=1 
+ 00 



S' 



C^pt^ip^v) , ., 









n [( fc encode I ^ 



2=1 
+ 00 p 

n 

2=1 



' encode 



^J2p Vipkp,J2p f^ipK 



( 5 ) 



for some constants independent of k and k' . Because p is invertible, it 

is clear that ki = fc' for all i € 2Z^ is the unique solution for the systems of 
linear equations J^pTip^p = Y.pTipk'p- Consequently, = 

^k.k' As^£i, for some constant independent of k and k' . Thus, the mapping 
in Eq. 0 is a QCC. □ 



Now, let me uses Theorem P to give an example of QCC. 

Example 1. Starting from the five qubit perfect code for N = 2 jbllUliSj . Theo- 
rem P implies that the following QCC can correct up to one error in every five 
consecutive qubits: 
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+ 00 



i=l 



1 



AT-l 



iV3/2 



E (- 1 ) 



{ki+ki-i){pi+qi+ri) +pi ri 



Pi,qi,ri=0 

\pi,qi,Pi + ri,Qi + ri,pi + qi + ki + fci_i) 



( 6 ) 



where km G {0, 1} for all m G . The rate of this code is 1/5. 

Although the QCC in Eq. (EJ looks rather complicated, the actual encoding 
process can be performed readily. Since p is invertible, one can reversibly map 
|fei, fc 2 j • ■ ■ , ^Ti, ■ • ■) to I ^2p Pipf^p: X/p ■ ■ ■ I X/p Pnpkp, ■ . .) |lKiii^| . Then, 

one obtains the above five bit QCC by encoding each quantum register using 
various encoding procedures described in Refs. 

3 Constructing Quantum Convolutional Codes from 
Classical Convolutional Codes 

In this Section, I report a general method to construct QCCs from classical 
convolutional codes. My construction is based on the following two technical 
lemmas which hold for both QBCs and QCCs: 

Lemma 1. Suppose the QECC 

(7) 

{j} 

corrects (independent) additive spin flip errors in certain quantum registers. 
Then, the following QECC, which is obtained by discrete Fourier transforming 
every quantum register in Eq. (0), 



1 ^) ^ E n 

U,p] i=i 

corrects (independent) phase errors occurring in the same set of quantum regis- 
ters. The converse is also true. 

Proof. Consider two arbitrary but fixed additive spin flip errors £ = 
and £' = acting on the code in Eq. ( 0.1 denote the set of all quantum 

registers affected by either one of the above spin flip errors and unaffected by 
both errors as A and U , respectively. Then Eqs. ([[J) and 0 imply that 



E 

{3,3'} 




\ieA 



dk,k' As^s' 



(9) 



for some constant Ag^si independent of k and k' . 
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For additive spin errors, = ifi + = ^j',+a'.,ji+ai for 

some constants ai,a^ € ^n- In other words, {j[\£'^£\ji) is a binary function 
of ji - j'i only. Thus, Eq. m still holds if I replace {ji\£'i^ £i\ji) by a binary 
function g{ji — j' ■. i G A). Moreover, the linearity of Eq. Q implies that the 
same equation holds if I replace {fi\£'i'' £i\ji) by any complex- valued function g 
taking arguments on ji — j[ for all i G A. That is to say. 



E 






gUi - j'i -iG A) 



A, ( 10 ) 



for some complex- valued Ag independent of k and k' . Conversely, it is obvious 
that if satisfies Eq. (1 1 1 )ll , then Eq. (0) is a QECC that is capable of correcting 
additive spin flip errors. In other words, Eq. m is a necessary and sufficient 
condition for the QECC to correct additive spin flip errors. 

Now, I consider the actions of two phase shift errors T and T' acting on the 
same set of quantum registers as those in £ and £' , respectively. Then 






encode/ 



= E 



-(kj 

a], 



(fc) 






{jM'.p.p'I 




X 




= E 



_(fe') (k) 

a., a) 




E 






p'r.i^A _ 



X 




( 11 ) 



For phase shift errors, {p'i\J''i ^i\Pi) = jpty. h for some complex-valued function 
h oi Pi i G A with \h\’^ = 1. Consequently, Eq. (II III can be further simplified as 



(k( 



encode 



encode) 



E 


-(fc') {k) 

3 3 


n 




{j'd'.p} 




\ieu / 





( 12 ) 



for some complex- valued function h{pi : i G A). Summing over all the piS, in 
Eq. (II 2ll . I obtain 



(kl 



encode 



|J^'Ij^|kencode) 



= E 

UJ'} 



-{k') (fe) 

a), ^ a\ ^ 

3 3 



Kiev 



h'iji - -.iG A) 



(13) 
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for some complex- valued function h'{ji — j[ : i C A). Comparing Eqs. lilull 
and (23, one concludes that (fcenj.odg|lCdj^|A;e„code) = Sk,k' Ay^^y^> for some 
independent of both k and k' . Thus, the QECC given in Eq. (|3 corrects the 
phase shift errors as promised. 

Conversely, from Eq. m, one concludes that Eq. (0) corrects phase errors 
if and only if 



E 

{j'd'} 






- -iG A) 



^k.k' Afi' 



(14) 



for any complex- valued function h' {ji — j' : j C A). Hence, from Eq. (HOI) , one 
concludes that Eq. o is able to correct additive spin flips errors. □ 

In essence. Lemma Qtells us that the abilities to correct additive spin flip and 
phase shift form a dual pair under the discrete Fourier transform of quantum 
registers. An interesting case occurs when N = 2. Here, additive spin flip is the 
only possible kind of spin flip error. As a result, the abilities to correct spin 
flip and phase shift errors in TV = 2 form a dual pair under Lemma G1 And this 
special form of LemmaHwas proven earlier by various authors (see, for example. 
Refs. lECCSl). 

Corollary 1. If a QECC handles both spin flip and phase shift errors on the 
same set of quantum registers, then this QECC handles any general quantum 
errors occurring at the same set of quantum registers. 

Proof. Combining Eqs. (il i)t and m, one knows that Eq. m holds for any 
complex-valued function g{ji,j- : i G A). By putting (j'|£f£i|ji) = g{ji,j'f) for 
all i G A, then one concludes that the above QECC is capable of correcting any 
general quantum errors as promised. □ 



Lemma 2. Suppose QECCs Cl and C2 handle phase shift and spin flip errors, 
respectiuely, for the same set of quantum registers. Then, pasting the two codes 
together by first encodes the quantum state using Cl then further encodes the 
resultant quantum state using C2, one obtains a QECC C which corrects general 
errors in the same set of quantum registers. 



Proof. Clearly C can handle spin flip errors occurring at the specified quantum 
registers. So from Corollary ^ it remains to show that C corrects phase errors 
as well. Let the encodings for Cl and C2 be \k) i — > J2{j} \j) i — > 

^{p} ^P respectively. Then using the same set of notations as in the proof 
of Lemma ^ one knows that 



iK 



encode 






= E 

{jd'.p.p'l 




l[{p',\E';j^.\p^) 



\ieA 
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W_9t{Vi A) 



.ieA 



= E ] “j' ^ ^ ^P ^P ^ n 9(p^ - ie A) I (15) 

{j’j'.p} ^ AeA . } 

for some complex- valued functions gi{pi \ i & A) for all i ^ A. 

Since C2 handles spin flips, one demands that whenever j ^ j' , 

E ^P ^p '’ = ® = E ^p' ^ ^p '’ ’ (16) 

{p} {p,p'l 



where £ denotes a possible spin flip error that can be handled by the QECC C2. 
Consequently, 

/ 

^ = 0 , (17) 

{p} 

where the above primed sum is over either (1) all the p that is affected by the 
error £, or (2) all the p that is unaffected by the error £. 

From Eq. (C3), it is easy to see that after summing over all piS in Eq. m, 
one will arrive at 



(fc:=„codel-^'l-^|feencode) = J] 



lid'} 



_(fe') (k) 

o-A, a\ ’ 

3 3 



^3,3' ( ^i(ji • * C A) 

\ieA J 



(18) 



for some complex- valued function hi{ji : i G A). As Cl handles phase shift, 
one concludes that J^|fcencode) = ^k.k'Ayr jr,. Hence, the Lemma is 

proved. □ 



At this point, I would like to remark that the proof of the abilities to correct 
both spin flip and phase shift implies the ability to correct a general error for 
N = 2 can be found in Refs. |(il7ll4l2(H . Moreover, one should notice that the 
ordering of encoding in Lemma 13 is important. Encoding first using a spin flip 
code followed by a phase shift code does not, in general, result in a general 
QECC. After proving the above two technical lemmas, I report a method to 
construct QECCs from classical codes. 



Theorem 2. Suppose C is a classical (block or convolutional) code of rate r 
that corrects p (classical) errors for every q consecutive registers. Then, C can 
be extended to a QECC of rate that corrects at least p quantum errors for 
every q^ consecutive quantum registers. 



Proof. Suppose C is a classical (block or convolutional) code. By mapping m to 
\m) for all m G C can be converted to a quantum code for spin flip errors. 
Let C be the QECC obtained by Fourier transforming each quantum register of 
C. Then Lemmanimplies that C is a code for phase shift errors. From LemmaEl 
pasting codes C and C together will create a QECC of rate r^. Finally, the fact 
that C corrects at least p quantum errors for every q^ consecutive quantum 
registers follows directly from Corollary Q □ 




Quantum Convolutional Error Correction Codes 321 



Theorem 0 provides a powerful way to create high rate QECCs from high 
rate classical codes. 

Example 2 ((Shor)). Starting with the simplest classical majority block code of 
rate 1/3, namely, |fc) i — > |fc, fc,fc) for k = 0,1, Theorem El returns the famous 
Shor’s single error correcting nine bit code 1231 of rate 1/9: 

1 

|/c) I — > \p,p,p,q,q,q,r,r,r) . (19) 

p,q,r—0 

Alternatively, one may start with a high rate classical convolutional code. One 
of the simplest codes of this kind is the 1 /2-rate code in Eq. (El . Being a non- 
systematicQ and non-catastrophic0 code (see, for example, chap. 4 in Ref. ng 
for details), it serves as an ideal starting point to construct good QCCs. First, 
let me write down this code in quantum mechanical form: 

Lemma 3. The rate 1/2 QCC 

+00 +00 

(^) \ ki) ' ^ l^encode) = (^) \ ki + ki-2, ki + ki-i + ki- 2 ) , (20) 

i=l i=l 

where ki S for all i G , can correct up to one spin flip error for every 
four consecutive quantum registers. 

Proof. Here, I give a “quantum version” of the proof. Using notations in the 
proof of Theorem m I consider ''^£|fcencode) again. Clearly, the worst 

case happens when errors £ and £' occur at different quantum registers. And in 
this case, Eq. (1201) implies that exactly two of the following four equations hold: 

{ k2i + k2i-2 = k'2i + k'2i-2 

k2i + k2i-l + k2i-2 = k'2i + k'2i_i + k'2i_2 
^21-1-1 + ^21-1 = ^21+1 + ^2i-l 
^2i+l + k2i + k2i-l = ^2^+1 + k'2i + k'2i_i 

for all i G 2Z^ . One may regard fciS as unknowns and fc(s as arbitrary but fixed 
constants. Then, by straight forward computation, one can show that picking 
any two equations out of Eq. 112 1 II for each i will form an invertible system with 
the unique solution ki = k[ for all i G 2Z^ . Therefore, (fc/j^j.Q(jg|5''f£|fcencode) = 
5k.k' Ss,e' and hence this lemma is proved. □ 



Example 3. Theorem El and Lemma El imply that 



+ 00 



2 = 1 



+ 00 




2 = 1 



E l (fci+fei_2)pi + (/Ci+/Ci_i+fei_2)9i 

Lpi,qi,... 



\Pi+Pi-l, 



Pi + Pi-1 + qi-i,qi -I- qi-i,qi -|- ft-i + Pi) 



( 22 ) 



^ That is, both bt and a are not equal to ai. 

^ That is, a hnite number of channel errors does not create an infinite number of 
decoding errors. 
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where kt G Zn for all i G , is a rate 1/4 QCC capable of correcting up to 
one quantum error for every sixteen consecutive quantum registers. 

In what follows, I show that this code can in fact correct up to one quantum 
error per every eight consecutive quantum registers. 



Proof. Suppose £ and £' be two quantum errors affecting at most one quantum 
register per every eight consecutive ones. By considering |fcencode), 

I know that at least six of the following eight equations hold: 






P2i-\ + P2t-2 = P2i-1 + P2Z-2 
P2i-1 + P2t-2 + Q2t-2 = P2i-1 + P21-2 + '?2z-2 
q 2 i-l + Q 2 i -2 = q' 2 i-l + 92i-2 
q2i-l + q2i-2 + P2i-l = q'2i-l + q'2i-2 + P2i-1 
P2t + P2i-l = P2i + P2t-1 
P2z + P2i-1 + 92*- 1 = P2i + P2t-1 + 92*- 1 
92*+ 92*-l = 92* + 92*-1 
92* + 92*- 1 + P2i = 92i + 92i-l + P2i 



(23) 



for all i G . Again, I regard pi and qi as unknowns; and and 9 ' as arbitrary 
but fixed constants. Then, it is straight forward to show that choosing any six 
equations in Eq. (12 311 for each i G would result in a consistent system having 
a unique solution of pi = p[ and qi = 9 ' for all i G . Consequently, 



(fcencodel^'^^lfe encode) 
+00 r 



E n 

{p.q} K 2=1 



UJ 



N 



x(/*|f*'^|/*)(9*|f|9*) 



(24) 



for some linearly independent functions fi{p,q) and gi(p,q). 

Now, I consider a basis {hi{p, q)} for the orthogonal complement of the span 
of {/G 9 *}jg^+- summing over all hiS while keeping fiS and piS constant in 
Eq. m, one ends up with the constraints that kt = fc' for all i G . Thus, 



= <5. 



(feencodelf'^^lfc encode) 

+CXD 

n 9 )l^'V*(P, 9)) { 9 ^{p, 9 )l^l 9 *(p, 9))) 



k.k' E 

{p,g} 



(25) 



Hence, Eq. H 22 I 1 corrects up to one quantum error per every eight consecutive 
quantum registers. □ 



From the discussion following Example D the encoding in Eq. (r21)l can be 
done efficiently with the help of reversible computation ( 1 121 1 2 j . 
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4 Outlook 



It is instructive to investigate the coding ability of QCCs as compared to that 
of QBCs. Knill and Laflamme HH proved that it is impossible to construct a 
four qubit QBC that corrects one general quantum error. Their result can be 
extended to the case when N > 2 EDI. Here, with a slight modification of Knill 
and Laflamme’s proof, I show that: 

Theorem 3. It is not possible to construct a QCC which corrects one general 
quantum error for every four consecutive quantum registers. 

Proof. Clearly, the QCC must be of rate 1/4. And with a simple permutation of 
the quantum registers, a general QCC of rate 1/4 can be written as 

|fc) ' *■ |feencode) = ^ ^ 2/j '^) ■ (^6) 

{w .X ,y .z} 



Without lost of generality, I may assume that quantum errors occurs in any one 
of the following four set of registers: |te), \x), \y) and \z). 

Then, following Knill and Laflamme [HI by considering the action of errors 
in the above four sets of registers, one arrives at ^ = 0 and ^ 

for all k ^ k' . where 



(fc) 

Pw' ,x' \w ,x 



= E 



' n ^ 

w’ .x' .y.z w,x.y,z 



fk) 



(27) 



Hence, the reduced (Hermitian) density matrices p^^^ are nilpotent for all k. 

(k) 

This is possible only if a-w'x.y.z = 0 for all k,w,x^y, z. This contradicts the 
assumption that |fcencode) is a QCC. □ 

It is, however, unclear if QCC can perform better than QBC in other situa- 
tions. And further investigation along this line is required. 
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Abstract. Most of the quantum error-correcting codes studied so far 
fall under the category of additive (or stabilizer) quantum codes, which 
are closely related to classical linear codes. The existence and general 
constructions of efficient quantum codes that do not have such an un- 
derlying structure have remained elusive. Recently, specific examples of 
nonadditive quantum codes with minimum distance 2 have been pre- 
sented. We, however, show that there exist infinitely many non-trivial 
nonadditive codes with different minimum distances, and high rates. In 
fact, we show that nonadditive codes that correct t errors can reach the 
asymptotic rate R = 1 — 2 H 2 ( 2 t/n), where H 2 {x) is the binary entropy 
function. In the process, we also develop a general set of sufficient con- 
ditions for a quantum code to be nonadditive. Finally, we introduce the 
notion of strongly nonadditive codes, and provide a construction for an 
((11,2,3)) strongly nonadditive code. 

key words: quantum code, additive code, Gilbert-Varshamov bound. 



1 Introduction 

Almost all quantum error-correcting codes known so far are additive (or stabi- 
lizer) codes. An additive code can be described as follows. Consider the group 
Q of unitary operators on the Hilbert space defined by the tensor products 

±Mi (g) M2 ® ® Mn, where each Mi is either the identity I = 

the Pauli matrices (Jz, or ay = ioxaz- Then an additive code is a subspace 
Q of for which there is an Abelian subgroup H of Q such that every vector 
of Q is a fixed point of every operator in H miTi n This approach leads to a 
close connection between self-orthogonal (under a specific inner product) linear 
binary codes and additive codes, such that the minimum distance of the additive 
code is determined from the binary code. 

^ This is actually the definition of a real additive code; i.e., a code which has a basis 
consisting of vectors from . In this paper we restrict ourselves to the set of real 
codes, but this does not restrict our results, since every additive code is equivalent 
to a real one m 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 325-^3 1999- 
© Springer- Verlag Berlin Heidelberg 1999 
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It is natural to ask whether there is any quantum error-correcting code that 
cannot be constructed in this way, directly or via some equivalence. We should 
make here a comment on the correct formulation of this question. Since the 
dimension of every additive quantum code is a power of 2, any quantum code 
whose dimension is not a power of 2 is not additive or equivalent to an additive 
code; especially, any subspace of an additive code with dimension not a power of 2 
is a nonadditive code. We call such codes trivial nonadditive codes. But we prove a 
general theorem that shows that infinite families of non-trivial nonadditive codes 
with different values of d exist. The nonadditiveness of these codes does not follow 
from their dimensions (the dimensions of these codes are also powers of two), 
but from their very special structure. Moreover, we show that these nonadditive 
codes asymptotically reach the same rate as Calderbank-Shor-Steane codes. 

We also propose the notion of strongly nonadditive codes: a quantum code 
Q is strongly nonadditive if the trivial code is the only additive code that 
contains any code equivalent to Q. Now the interesting problem is to find strongly 
nonadditive quantum codes. Recently in H3| it is shown that a ((5, 6, 2)) strongly 
nonadditive code exists, which is better than any ((5, RT, 2)) additive code. Later 
in C21, Rains showed that there exists a ((2m, 4™ ^,2)) nonadditive code, for 
every m > 3. We present an ((11,2,3)) strongly nonadditive code. 

In Section 0 we first determine a criterion that guarantees additiveness and 
strongly nonadditiveness of quantum codes, and then we present our examples 
of additive and strongly nonadditive codes. 

2 Preliminaries 

Consider the Hilbert space with its standard basis |?;i) , . . . , |r' 2 "), where 
111 , . . . , ti 2 " is a list of binary vectors of length n in {0, 1}”. For every binary 
vector a of length n, we define the unitary operators Xa and by the following 
equations 

-’fa ki) = \Vi + a ) , 

\v,) = \v,) . 

Note that X^Zp = (— 1)“'^Z/3X„. 

Let Q be the group of all unitary operators of the form iMi®- • where 

Mi G { I ,ax,ay,Gz }. Then every member of Q can be represented uniquely as 
(— 1 )’'XqZ/ 3, where A G {0,1} and a,/3 G {0,1}". For every subgroup S of Q, 
let S C {0, 1}^" be the set of all vectors (o;|/3) such that either X^Zf} G 5 or 
—XaZ /3 G S. We say S is totally singular for every (a|/3) G 5 we have a-/3 = 0. 
We also define a special inner product on {0,1}^" as 

iy{a\b),{a'\h')) = a-h' + a' -h, (1) 

where the right-hand side is evaluated in GF(2). For any quantum code Q in 
, we define the stabilizer Tig of Q as 

Hq = { ip & Q ■. g}\x) = \x) for every \x) in Q} . 
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Then it is easy to check that Tig is an Abelian group and every element of 
TLq squares to the identity operator. So Tfg is totally singular. It also follows 
that TLq is isomorphic to a vector space GF(2)™, for some m. This means that 
TLq IS generated by operators ipi,... & TLq and every ip G TLq can be 

written (uniquely, up to the order of the pi’s) as p = pi^^ ■■■Pm‘^"", where 
Ci G {0, 1}. In this case the quantum code Q has dimension 2"“”^. Suppose that 
Pi = So TLq can be determined by its m x (2n) binary generating 

matrix 



/ ai 


Pi\ 


\ 


Pm) 



Note that if such a matrix M obtained from a stabilizer, then Oi ■ (3i = Q and 
oti ■ Pj + aj ■ Pi = 0, for every i and j. A quantum code Q is called additive (or 
stabilizer) if it is defined by its stabilizer TLq, i.e., 

Q = I |x) G ■. p\x) = lx) for every p G TLq | . 

The quantum codes Qi and Q 2 in are locally equivalent if there is a 
transversal operator lA = u\ ® ® Un, with Ui G SU(2), mapping Qi into Q 2 - 

We say these codes are globally equivalent, or simply equivalent, if Qi is locally 
equivalent to a code obtained from Q 2 by a permutation on qubits. 

A quantum code Q C is called nonadditive if it is not equivalent to any 
additive code; moreover, Q is strongly nonadditive if the only additive code 
that contains any code equivalent to Q is the trivial code ; in other words, 
if -TXaZp is in the stabilizer of any code equivalent to a supercode of Q then 
a = P = 0. 

A AT-dimensional subspace of that as an error-correcting quantum code 
can protect against < d/2 errors, is called an {{n,K,d)) code. If this code is 
additive, then K = 2^, for some k, and is called an [[n, k, d]] code. The following 
theorem gives a sufficient condition for a subspace of to be an {{n,K,d)) 
code. Here wt(c) denotes the Hamming weight of the binary vector c, i.e., the 
number of 1-components in c, and a U /3 is the binary vector resulting from 
a component-wise OR operation of a and P; for example (10110) U (00101) = 
(10111). 

Theorem 1. (Q> El) Let Q be a K -dimensional subspace of . Consider 
an orthonormal basis for Q of the form { \ci) \ i = 1, . . . , AT}. Then Q is an 
((n, K, d)) code if {ci \ XaZp \ cj) = 0 for every 1 < i, j < K and for every a, P G 
{0, 1}" with 1 < wt(aU/3) < d—1. In general, a necessary and sufficient condition 
for Q to be an ((n, K, d)) code is that for all 1 < i,j < K and wt(a U /3) < d—1 
we have {ci \ XaZp \ c/f = Lcj \ X^Zp \ Cj) and if i j then {ci \ XaZp \ cf) = 0. 

□ 

For an additive code Q with stabilizer TLq there is a sufficient condition in 
term of the dual of Tdg with respect to the inner product defined by equation 
m for Q to be a t-error-correcting code. 
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Theorem 2. ( 0 , 0 ) Let Q be an additive eode with stabilizer LIq. Let H-q^ be 
the spaee orthogonal to Hq with respeet to the inner produet (0). If for every pair 
of binary vectors a, /3 G {0, 1}" with wt(aU/3) < d— 1 we have {a\j3) ^ Hq \Hq 
then Q is an [[n, fc, d\] additive code. □ 



3 Existence of Nonadditive Codes 

3.1 Quantum Codes Equivalent to Additive Codes 

We study the quantum codes equivalent to additive codes. For such code Q, we 
find a sufficient condition that guarantees that the stabilizer of Q contains a 
nontrivial operator. 

We begin with some useful notions and notations. Let |ci) , . . . , |c 2 ") be the 

standard orthonormal basis of , where each Ci is a binary vector of length n. 
2 " 

For the vector \x) = \i \ci), we define the support of jx) as 

i=l 

supp(|x)) = { Ci G {0, 1}" : Ai yf 0 } . 

Let C C {0, 1}" be a set of binary vectors. Define the vector \C) in as 

= |C|l/2 • 

' ' cGC 



(If C is empty then \C) is the zero vector.) For any binary vector a of length 
m < n, define 

C„ = {xG {0,1}’"-™: (a,x) gC}. (3) 

So to construct Ca, consider all vectors in C starting with a (if there is any), 
then delete a from these vectors. Note that Ca may be empty. 

For a quantum code Q, let us define the generalized stabilizer of Q as the set 
GS{Q) of all unitary operators V on such that V |x) = |x) for every |x) G Q. 
Then the stabilizer of Q is St(Q) = ^ n GS{Q). 

Lemma 1. Suppose that the quantum codes Q\ and Q 2 are locally equivalent via 
the transversal unitary operator IL. Then for every M G GS{Q\) the operator 
UMU^ is in GS{Q 2 ). 

Proof. Let |x) G Q 2 . Now, we know that there exists a code word \y) G Qi 
such that |x) = U\y). Since M \y) = \y), so {MW)U \y) = \y), and therefore 
[UMU^)U \y) =U\y). This implies (UMU^) |x) = |x). □ 

We are interested in the case of M G i.e., M = Mi 0 • • • ® Mn, where 
Mj G {I,ax,<Ty,az}. We define wt(M) the weight of any M G Q as the number of 
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j’s such that Mj ^ I. In this case UPlW = ui ® • • • 0 such that det(uj ) = ±1 
and if Mj = I then Vj = I, otherwise 

Vj = r]j , rjj e {1, i}, aj G K and bj G C. (4) 

If G SU(2)®" then U is of the form ui ® ® Un, where each Uj is defined by 

a matrix of the form 

/ e®“ cos 9 sin 6 \ 

\^— sin 9 cos 9 J ' 

IfM, = o’x, cTz or ay, then the corresponding Vj, respectively, is 

/ sin 29 cos(a — /3) cos^ — sin^ 
l^cos^ — sin^ — sin20cos(a — /3) y’ 

/ cos 20 — sin20e®^“+^^\ 

sin —cos 20 y’ 

/ —i sin 20 sin(a — f3) — cos^ 0e*^“ — sin^ 0e*^^\ 

l^cos^ 0e~*^“ + sin^ 0e“*^^ i sin 20 sin(a — (3) ) \ 

We call a matrix vt as full if at ■ bi 0; and we say the unitary operator 
V = ui 0 • • • 0 u„ is thin if none of t;i’s is full. In the next proof we will use this 
property that if V is thin then |supp(V |x))| = |supp(|a;))|, for every |a:). 

A quantum code Q is called real if Q has a basis consisting of real vectors; 
2 " 

i.e., if |a:) = ^ Xi \ci) is any vector in the basis, then Xi G IR, for every i. 

i—1 

An {n,K,d) binary code is a set C C {0,1}" of size K such that any two 
vectors in C differ in at least d places, and d is the largest number with this 
property. Note that an [n, k, d\ binary linear code is an (n, 2^, d) binary code. 

Theorem 3. Suppose that the quantum eodes Q\ and Q 2 are loeally equivalent 
via the transversal operator U, Q 2 is real and Q 2 eontains \C), where C is an 
(n, K, d) binary code with d > k = [log 2 K~\ . Then the following claims hold. 

(i) The image 0 / St(Qi) under the mapping M 1 -^ UMU\ which we call 
r, consists only of unitary operators of the form zLXaT, where T is a Z-type 
unitary operator of the form 



( 6 ) 



n 

T=(g) 



0 

0 



( 7 ) 



j=i ' 

(a) Let A = {a £ (0, 1}" : ±A„T G T for some T of the form Q }. Sup- 
pose that St(Q 2 ) does not contain any operator of the form zLXqZis, with /3 yf 0 . 
T/ien |St(Qi)| < |A|. □ 



The proof of this theorem can be found in m 
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We now present a criterion for nonadditiveness of quantum codes. First a 
useful notation. For a subset C of {0, 1}" let 

T{C) = {x G {0,ir : X + C C C} . 

If C is a binary linear code then T(C) = C. 

Theorem 4. Suppose that the quantum code Q of dimension 2^ is real and 
contains \C), where C is an (n, K, d) binary code with d > |"log 2 K~\ . If the identity 
operator is the only unitary operator in the stabilizer of Q and 2"“^ > |T(C)| 
then Q is nonadditive. 

Proof. Suppose, by contradiction, that Q is equivalent to additive code Q' via 
the transversal unitary operator U which maps Q' on Q. Let P be the image of 
St(Q') under U. Define A C {0, 1}" as in (ii) of Theorem 0 Then A C T{C). 
Thus 



2"-^= |St(Q')l < 1^1 < l^(C)|, 

which contradicts the assumption of the theorem. □ 

When the binary code C in the above theorem is linear we can formulate the 
theorem as follows. 

Corollary 1. Suppose that the quantum code Q of dimension 2^ is real and 
contains \C), where C is a linear [n, fc,d] code with d > k. If St{Q) = {/} and 
n > k + i then Q is nonadditive. □ 

Finally, we formulate a criterion that guarantees strongly nonadditiveness of 
quantum codes. 

Theorem 5. Suppose that the quantum code Q is real and it contains \C) where 
C is an (n,K,d) binary code with d > |"log 2 iF] . //St(Q) = {/} and GS{Q) does 
not contain any operator of the form X^T, where a yf 0 and T is of the form 
m, then Q is strongly nonadditive. 

Proof. Suppose, by contradiction, that Q C and Qi yf is equivalent to 
an additive code Q' with St(Q') yf {/}. Then, by Theorem 0 any nontrivial 
stabilizer ip of Q' defines an operator V = ui 0 • ■ • 0 in GS{Qi) C GS{Q), 
where Vj = / or it is of the form (0) or Q. If all Vj have real matrices, then 
V y^ / and V G St(Q), which is impossible. If at least one of Vj has a complex 
matrix, then V is of the form X^T with a y^ 0, which is again impossible. □ 



3.2 Construction of Nonadditive Codes 

Examples of Nonadditive Codes. Now we show that there is an infinite fam- 
ily of nonadditive quantum error-correcting codes. These codes are constructed 
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following the scheme similar to the one described in Theorem 2.4 of Con- 
sider an [n, k] binary code C such that dist(C) and dist(C-*') are both at least do 
(C needs not to be a weakly self-dual code). 

First we define a function r : C — > {0, 1}" such that for c,c' and c' 
we have t(c)-|-t(c') ^ C^. This means r(c) and r(c') are in different cosets of C**- 
in {0, 1}", for c yf d . Since there are 2^ different cosets, such mapping r always 
can be defined. 

Fix d < do, and let £ be the set of binary vectors of length n with weight 
< d — 1. Consider a subset R = { oq, oi, . . . , } of {0, 1}" such that oq = 0 

and Gj is not of the form c -I- -I- e, for c G C, 1 < i < j — 1, and e € £. Then 

the vectors 



cGC 



form a basis for a quantum code with distance d. To prove this, we show that 
{xi I XaZp I Xj) = 0, for 0 < wt(a U /3) < d. The case a yf 0 or * yf j 
is straightforward. So we only consider the case a = 0 and i = j. Then for 
0 < wt(/3) < d we have 



{Xi I Zf3 I Xi} 




cGC 




cGC 

= 0 . 



The last equality follows from the fact that dist(C'*“) > d, so /3 ^ C^. 

Lemma 2. In the above construction, suppose that 

(n-l)2'=|^7^] <2^^-C (9) 

i=0 ^ ^ 



Then it is possible to choose n linearly independent vectors oi, 02 , . . . , a„ so that 
the {{n,n+ l,d)) quantum code Q with the basis jcco) , l^i) , . . . , \xn) (each \xi) 
is defined by (Q) has trivial stabilizer, i.e., St(Q) = {/}. 

Proof. Suppose that the vectors ao,ai, . . . , a™ with the desired properties are 
chosen. Then it is possible to choose a vector Um+i such that oi, . . . ,am,CLm+i 
are independent and Om+i is not of the form c -I- -|- e (for c G C, 1 < * < m, 

d-l 

and e G f ) if 2™ -I- m • 2* • 

i=0 

n vector oi, . . . , a„ with the desired properties. 

Now we show that the identity operator is the only member of the stabilizer 
of Q. Suppose that X^Zp is in the stabilizer of Q. Since 



< 2”. This shows that it is possible to choose 



X^Zfi\xo) = Y.i-irf^\c + a) 

cGC 
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should be equal to |xo) = ^ |c) it follows that a G C and (3 G C^. Similarly, for 

cGC 

every 1 < t < n since 

X^Zp \xi) = ^(_i)-M-^+(=+“d-/3 |c + a, + a) 
cGC 

= ^(_i)^(c+«)'“.+(c+“i+«)'/3 |c + a^) 

cGC 

= ^(_i)(^(=+«)+/3)'<^. \c+ai) 

cGC 

should be equal to 

c^C 



it follows that • (r(c) + r(c + a) + (}) =0, for every 1 < i < n. Since Oi’s 
are independent, therefore r(c) + r(c + a) = /3 G hence a = 0. Now the 
conditions Oi • /3 = 0 (for 1 < j < n) imply /3 = 0. □ 



Theorem 6. Suppose that C is an [n,k,do\ binary linear code such that do > k 
and dist{C) and dist{C^) are at least d. Moreover, suppose that n, k and d satisfy 
(0. Let £ be the greatest integer such that 2^ < ("). Suppose that 

k + £ <n. Then there is a an {{n,2^,d)) nonadditive code. 

Proof. Consider the ((n,n + l,<i)) code Qo constructed in the previous lemma. 
Then by Theorem 4.2 of it is possible to add at least 2^ — {n+ 1) more vectors 
to Qo to build an ((n,2^,d)) code Q, which is, by Corollary nonadditive. □ 

As an application we show that there are {{n, [2"“^/(n+ 1)J , 2)) nonadditive 
codes, for every n > 8. Consider the [n, l,n] binary code C = {0,1}. Then 
is consists of all even weight vectors in {0,1}"', so it is an [n,n— 1,2] code. The 
condition m satisfies if n > 8. Then by applying the above theorem (for k = 1 
and £ = [n — 1 — log 2 (n + 1)]) we get the desired code. Other classes of binary 
codes for which the minimum distance of the code and its dual are known (such 
as Hamming codes and Reed-Muller codes) can be used to get nonadditive codes 
with different parameters. 

Finally, we show that the nonadditive codes are almost as good as Calderbank- 
Shor-Steane (CSS) codes, at least in the case that the dimension of code is large 
enough. 

To utilize the CSS codes for constructing nonadditive codes, we must modify 
them such that the new codes have trivial stabilizers. Let Q be an [[n, n — 2k, dj] 
CCS code based on the weakly self-dual [n, k] code C with dist(C-*-) > d. Consider 
the basis for Q consisting of vectors of the form |a:a) = E |c + a), for a G jC. 
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Also consider the function r: C — > {0,1}" defined at the beginning of this 
section. We define the quantum code Q with basis 

= ( 10 ) 

cGC 



for a e jC. Then it is easy to check that Q is also an [[n, n — 2k, d]] code. 

Theorem 7. Suppose that C is an [n, k, do] weakly self-dual binary code, and 
C'*' is an [n,n — fc, di] code. Assume do > k and > n — k—1 (for example 

it is enough to have k < (n — log 2 n)!2). For any d < d\ that staisfies 

(2"-'= + {k- 1)2'=) XI (11) 

i=o 

we have an ((n, 2"“^'=, d)) nonadditive code. 

Proof. Let Qo be the [[n,n — 2fc,dj] CSS code based on C, and let Qo be the 
quantum code obtained from Qq as described in the preceding procedure. We 
can choose independent vectors ai,... ,an-k in such that afs belong to 
different cosets of C in . This is possible because 2"“^'=^ > n — fc — 1. We 
consider \yaf) , ■ • ■ , \ya„-k) (defined by (CDJ) as vectors in Qo- Then we choose 
vectors On-k+i, ■ • ■ , such that oi, . . . , o„ are n independent vectors, and Q' = 
Qo U {|a:a„_fc+i) , ■ • ■ , |a^a„)}, is an ((n,2”“^'= + k,d)) code. The inequality (im 
implies that it is possible to choose On-k+i, ■ • ■ ,On with the desired properties. 
Then the proof of Lemma El shows that St(Q') = {/} 

Let Q be the quantum code obtained from Q' by removing any k vectors 
except |yai), i = 1,... ,n. Then St(Q) = {/} (because Q contains the lya,), 
i = I, . . . ,n). So, by Corollary ^ with £ = n — 2k, Q is nonadditive. □ 

To show that there are weakly self-dual codes C that satisfy the requirements 
of the above theorem, it is possible to apply the greedy method used in classical 
coding theory (see m, Chap. 17). The same method is used in 0 to prove 
the existence of CSS codes meeting the Gilbert-Varshamov bound. This method 
gives the following bound. 

Theorem 8. For d < Xn, where A = FFf ^ {FI {1/2)), there are nonadditive 
{{n,2^ , d)) quantum codes with rate k/n > 1 — 2 F£ 2 {d/n) . □ 



A Strongly Nonadditive Code. In this section we provide an example of 
a strongly nonadditive quantum error-correcting code. This is an ((11,2,3)) 
strongly nonadditive code. 

Consider the (Paley type) Hadamard matrix of order 12 (see, e.g., [II3], p. 
48). Delete the all-1 column and replace —1 by 1 and +1 by 0. The result is the 
following matrix 
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00000000000 
10100011101 
11010001110 
01101000111 
10110100011 
11011010001 
^ ~ 11101101000 ■ 

01110110100 

00111011010 

00011101101 

10001110110 

01000111011 

We denote the row of H by The set C = { : 1 < j < 12 } is an (11, 12, 6) 

code. Then a basis for the desired quantum code consists of the following two 
vectors: 

12 

12 

|1l) = ^|l + ri). 



where 1 is the all-1 vector of length 11. We claim these vectors are basis for an 
((11,2,3)) quantum code. We have to show that 

{Ol\Xo,Z0 |0l) = 0, 

(U I I U) = 0, 

(Oi I x^Zp I U) = 0, 

for every a, P G {0, 1}^^ such that 1 < wt(a U /3) < 
distance of any two distinct vectors in the set 

{r*:l<j<12}U{l + ri:l<i<12} 

is at least 5. Thus if 1 < wt(a) < 4 then all conditions (I12I -(I14| 1 hold. Now 
suppose that a = 0. Then (II 411 trivially holds. To see that 1121) and (II 311 hold it 
is enough to note that if 1 < wt(/3) < 2 then • /3 = 1 for exactly 6 values of i. 
This completes the proof that {|0i),|lL)}isa basis for an ((11,2,3)) quantum 
error-correcting code. 

To show that this code is nonadditive, let (p = {—l)^XaZi 3 be any operator 
in the stabilizer of this code. Since (p\0l) = |0l) and p |ri) = |a), hence A = 0 
and a should be one of r^’s. Then we should have a = ri = 0, because for every 
Vi, i ^ 1, there is some j such that is not equal to any r^. Therefore, 

p = Zfj. Then 

12 12 

lU) = 5](-l)(i+’'‘)-'^ |1 + r,) = ^ |1 + n) 

2=1 2=1 



( 12 ) 

(13) 

(14) 

2. First note that that the 
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implies that (1 + r^) • /3 = 0, for every i. But the set { 1 + : 1 < i < 12 } has 

rank 11, so f3 = 0. This shows that the identity operator is the only operator 
in the stabilizer of this code. Finally, suppose that X^T is in the generalized 
stabilizer of this code, where the operator T is of the form O- Note that the 
operator T only affects the phases of the states, so the above argument also 
implies a = 0. Now Theorem implies that this code is strongly nonadditive. 

4 Concluding Remarks 

We showed that there are nonadditive codes with different minimum distances. 
We also showed that nonadditive codes that correct t errors can reach the asymp- 
totic rate R> 1 — 2H2{2t/n). We introduced the notion of strongly nonadditive 
codes, and gave an example of such codes. It would be interesting to find more 
examples of such codes. We conjecture that the nonadditive codes constructed 
in Section E3 are also strongly nonadditive codes. 

Recently we have improved the construction method for nonadditive quna- 
tum codes. With this new scheme, we are now able to give explicit construc- 
tions of nonadditive ((2m, i2^’”,2)) and strongly nonadditive ((2m -I- 1, |(1 — 
^) 22 m-i-i, 2 )) codes. Also we have improved the asymptotic Gilbert- Varshamov 
bound for the rate of nonadditive codes. The new bound, which is for strongly 
nonadditive codes, is the same as the bound for additive codes 0, i.e., R > 
1 — H2{2t/n) — (2t/n) log 2 3. All these results will appear in the final version of 
this paper PI- 
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Abstract. All the qnantum error correction schemes devised to snppress 
independent decoherence are shown to be applicable for reducing general 
spatially correlated decoherence. 
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The condition of independent decoherence plays an important role in devising 
the quantum error correcting codes (QEGGs) [1-3]. However, in practice, spatial 
correlation may take place in decoherence of qubits [4-6] . Are the QEGGs devised 
to correct single-qubit errors are applicable for reducing spatially correlated deco- 
herence? We calculate the state fidelity after applying the conventional quantum 
error correction procedure in the case of spatially correlated decoherence, and 
show that the QEGGs are still valid. This extends the decoherence model in the 
previous quantum error correction schemes to more realistic circumstances. 

We take the analysis of spatially correlated phase damping as an example to 
illustrate the general result. It is not difficult to derive that the master equation 
for spatially correlated phase damping has the form [7] 

(^) = I E (7b [ 2 <P it) {t) - p {t) a]a^] } , ( 1 ) 

i,i=l 

where p (<) is the reduced density of the qubits in the interaction picture, and 
trf are all Pauli’s operators describing the qubits. The coefficients 7 ^, whose 
values depend on the correlations of the bath operators and the separations 
of the qubits, completely characterize the spatial correlation properties. L is 
the number of qubits and e is a normalization constant to make 'jij satisfy the 

L 

normalization condition ^ 7 ^^ = 1. Define an L x L spatial correlation matrix 

i=l 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 337-ESl 1999- 
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r hy r = [yij\. -T is a real symmetric matrix, thus it can be diagonalized by an 
orthogonal matrix O as OrO~^ = diag (71, 72 , • • • , 7l) , where 71, 72, • • • , and 7^, 

L 

satisfying X) 7* = ^he eigenvalues of the matrix F. Define the operators 

i —1 

sf by the equation (sf , s|, • • • , s|) = (erf , erf , • • • , ct£) 0+. The master equation 
(3) is then rewritten as 

d ^ 

—p{t) = -iHeffp{t) + ip{t)Hl^^+ eyis^p{t)s^, (2) 

where the non-Hermitian effective Hamiltonian is 

^e// = -f • (3) 

1=1 

In the language of quantum trajectories [8,9], the system evolution described 
by Eq. (2) is represented by an ensemble of wave functions that propagate accord- 
ing to the effective Hamiltonian H^ff, interrupted at random times by quantum 
jumps. In a finite time Tq, we perform N times error correction. The state of 
the qubits, initially denoted by \F (0)), evolves in the time interval At = Tq/N 
according to Eq. (2). Suppose that e = max(e7iZ\t) << 1. Up to order e, the 
normalized state of the qubits after At will be either 



i^t)) 







(4) 



with probability Pi = eyiAt {F (0)| (sf)^ \F (0)) in case of a jump in decay channel 
I at a random time in the interval At, or 



1^0 {At)) = exp {-iHeffAt) \F (0)) 



(5) 



with probability po 



1=1 



1 — eZ\t ('?' (0)1 X 7; 1^ (0)) ~ o (e^) if no jump 



L 

occurred. Let s§ =X 
1=1 




which represents the first order error due to 



the effective evolution, whereas sf (/ = 1, 2, • • • , L) can be explained as the first 
order errors due to quantum jumps. For independent decoherence, sf and sf 
reduce to the single qubit operators / and erf, respectively. But for general 
spatially correlated decoherence, the errors sg and sf do not have this desired 
property. Nevertheless, here we show, that the QECCs devised to correct single- 
qubit errors are still valid in general circumstances, and moreover, the encoding, 
decoding , and error correction operations remain the same as those in the case 
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of independent decoherence. In the QECCs, phase errors are represented by the 
operators af (/ = 1, 2, • • • , L). For simplicity of symbols, let ag = I, where I is 
the unit operator, corresponding that there is no error. We consider orthogonal 
codes [10]. Most of the discovered codes belong to this class. For any orthogonal 
phase error correcting codes, the encoded input state j'f' (0)) satisfies [10] 

(tf^(0)|afa||^^(0))=% (6) 

where (i, j = 0, 1, • • • , L). 

During the error correction procedure, we first detect the error syndrome. 
If there is a af {i = 0,1, ■ ■ ■ , L) error ( a CTq error in fact means that there is 
no error.), i.e., the state becomes erf ['f'(O)), we apply the recovery operator 
erf to the state and thus get the initial encoded state |'f'(0)). In the case of 
spatially correlated decoherence, phase errors are represented not by af , but by 
sf instead. However, we still adopt the above error correction procedure. If there 
is a sf (i = 0, 1, • • • , L) error, which occurs with probability pi, we detect the error 
syndrome, and with probability [('?' (0)| cr| [>fi (2\t))| find that the error is cr|. 
After this detection, the state is collapsed into j'f' (0)). Then we apply the 
recovery operator cr| and get the initial state (0)). The whole error correction 
procedure described above yields the following average state fidelity 

=E E K K'Z' (0)1 a| [<f^, (Z\t))|" 

i=0i=0 f-j-j 

= l-o(e^) . 

L 

In deriving Fq. (7), we used Fq. (6) and the identity E = 1- After the 

i=i 

whole time Tq, the final average fidelity Ff (Tq) is approximately 

F,{To)^[F,{At)f ^l-o{Ne^). (8) 

Since e = max(£ 7 ;Z\t) oc ■^, Ne^ can be made arbitrarily small by a frequent 
repetition of the error correction procedure. This completes the proof that the 
conventional QFCCs are valid for reducing spatially correlated decoherence. 
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Abstract. Following a suggestion of A. Kitaev, we explore the connec- 
tion between fault-tolerant quantum computation and nonabelian quan- 
tum statistics in two spatial dimensions. A suitably designed spin system 
can support localized excitations (quasiparticles) that exhibit long-range 
nonabelian Aharonov-Bohm interactions. Quantum information encoded 
in the charges of the quasiparticles is highly resistant to decoherence, and 
can be reliably processed by carrying one quasiparticle around another. 
If information is encoded in pairs of quasiparticles, then the Aharonov- 
Bohm interactions can be adequate for universal fault-tolerant quantum 
computation. 



1 Fault-Tolerant Quantum Computation 

Quantum computers appear to be capable, at least in principle, of solving certain 
problems far faster than any conceivable classical computer fP-^]. In practice, 
though, quantum computing technology is still in its infancy. While a practical 
and useful quantum computer may eventually be constructed, we cannot clearly 
envision at present what the hardware of that machine will be like. Nevertheless, 
we can be quite confident that any practical quantum computer will incorporate 
some type of error correction into its operation. Quantum computers are far 
more susceptible to making errors than conventional digital computers 0-|S|, 
and some method of controlling and correcting those errors will be needed to 
prevent a quantum computer from crashing. 

The future prospects for quantum computing received a tremendous boost 
from the discovery by Peter ShorjO| and Andrew Steanejmm that quantum 
error correction is really possible in principle. But this discovery in itself is 
not sufficient to ensure that a noisy quantum computer can perform reliably. 
To carry out a quantum error-correction protocol, we must first encode the 
quantum information we want to protect, and then repeatedly perform recovery 
operations that reverse the errors that accumulate. Since encoding and recovery 
are themselves complex quantum computations, errors will inevitably occur while 
we perform these operations. Thus, we need to find methods for recovering from 
errors that are sufficiently robust to succeed with high reliability even when we 
make some errors during the recovery step. Such fault-tolerant recovery methods 
were first developed by ShorfEj and Alexei Kitaev^^l; these methods were later 
generalized and improved by Shor and David DiVincenzo|l4j. and by SteanefTHj. 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 341-^^^ 1999. 
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Furthermore, to operate a quantum computer, we must do more than just 
store quantum information; we must process the information. We need to be able 
to perform quantum gates, in which two or more encoded qubits come together 
and interact with one another. If an error occurs in one qubit, and then that 
qubit interacts with another through the operation of a quantum gate, the error 
is likely to spread to the second qubit. We must design our gates to minimize 
the propagation of error. The central challenge is to construct a universal set 
of quantum gates that can act on the encoded data blocks without introduc- 
ing an excessive number of errors. Such a scheme for fault-tolerant quantum 
computation was first developed by Shor ll2l and later generalized by Daniel 
Gottesmanlltil. 

Once the elementary gates of our quantum computer are sufficiently reliable, 
we can perform fault-tolerant quantum gates on encoded information, along with 
fault-tolerant error recovery, to improve the reliability of the device. But for any 
fixed quantum code, or even for most infinite classes that contain codes of arbi- 
trarily large block size, these procedures will eventually fail if we attempt a very 
long computation. However, there is a special class of codes {concatenated codes) 
which enable us to perform longer and longer quantum computations reliably, 
as we increase the block size at a modest rate^Jj-f^- Invoking concatenated 
codes we can establish an accuracy threshold for quantum computation; once 
our hardware meets a specified standard of accuracy, quantum error-correcting 
codes and fault-tolerant procedures enable us to perform arbitrarily long quan- 
tum computations with arbitrarily high reliability. 

With the development of fault-tolerant methods, we now know that it is pos- 
sible in principle for the operator of a quantum computer to actively intervene 
to stabilize the device against errors in a noisy (but not too noisy) environment. 
In the long term, though, fault tolerance might be achieved in practical quantum 
computers by a rather different route — with intrinsically fault-tolerant hardware. 
Such hardware, designed to be impervious to localized influences, could be oper- 
ated relatively carelessly, yet could still store and process quantum information 
robustly. 

In this paper, we explore a scheme for fault-tolerant hardware envisioned 
by Kitaevpi). in which the quantum gates exploit nonabelian Aharonov-Bohm 
interactions among distantly separated quasiparticles in a suitably constructed 
two-dimensional spin system. Though the laboratory implementation of Kitaev’s 
idea may be far in the future, his work offers a new slant on quantum fault 
tolerance that shuns the analysis of abstract quantum circuits, in favor of new 
physics principles that might be exploited in the reliable processing of quantum 
information. 

We explain in §2 that charges participating in long-range Aharonov-Bohm 
phenomena are impervious to local disturbances, so that quantum information 
encoded in such charges is robust. In §3 we argue that nonabelian Aharonov- 
Bohm interactions among quasiparticles arise in a class of two-dimensional spin 
systems. These interactions are discussed in detail in §4; we see that the exchange 
of two quasiparticles can modify the charges carried by the particles; thus par- 
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tides with different charges may actually be indistinguishable. In particular, a 
quasiparticle that carries a superposition of two different charges need not deco- 
here, because the local environment is indifferent to the value of the charge. In 
§5 we sketch our main result: that nonabelian Aharonov-Bohm interactions are 
adequate for universal quantum computation, in a model with a sufficiently rich 
group-theoretic structure. We conclude in §6 with some tentative speculations 
regarding the implications of quantum fault tolerance for fundamental physics. 

Recent claims about the potential for the fault-tolerant manipulation of com- 
plex quantum states may seem grandiose from the perspective of present-day 
technology. Surely, we have far to go before devices are constructed that can, 
say, exploit the accuracy threshold for quantum computation]^. Nevertheless, 
we feel strongly that recent work relating to quantum error correction will have 
an enduring legacy. Theoretical quantum computation has developed at a spec- 
tacular pace over the past three years. If, as appears to be the case, the quantum 
classification of computational complexity differs from the classical classification, 
then no conceivable classical computer can accurately predict the behavior of 
even a modest number of qubits (of order 100). Perhaps, then, relatively small 
quantum systems will have far greater potential than we now suspect to surprise, 
baffle, and delight us. Yet this potential could never be realized were we unable 
to protect such systems from the destructive effects of noise and decoherence. 
Thus the discovery of fault-tolerant methods for quantum error recovery and 
quantum computation has exceptionally deep implications, both for the future 
of experimental physics and for the future of technology. The theoretical ad- 
vances have illuminated the path toward a future in which intricate quantum 
systems may be persuaded to do our bidding. 

2 Aharonov-Bohm Phenomena and Snperselection Rules 

Topological concepts have a natural place in the discussion of quantum error 
correction and fault-tolerant computation. Topology concerns the “global” prop- 
erties of an object that remain unchanged when we deform the object locally. 
The central idea of quantum error correction is to store and manipulate quan- 
tum information in a “global” form that is resistant to local disturbances. A 
fault-tolerant gate should be designed to act on this global information, so that 
the action it performs on the encoded data remains unchanged even if we deform 
the gate slightly; that is, even if the implementation of the gate is not perfect. 

In seeking physical implementations of fault-tolerant quantum computation, 
then, we ask whether there are known systems in which physical interactions 
have a topological character. Indeed, topology is at the essence of the Aharonov- 
Bohm effect. If an electron is transported around a perfectly shielded magnetic 
solenoid, its wave function acquires a phase where e is the electron charge 
and <l> is the magnetic fiux enclosed by the solenoid. This Aharonov-Bohm phase 
is a topological property of the path traversed by the electron — it depends only 
on how many times the electron circumnavigates the solenoid, and is unchanged 
when the path is smoothly deformed. (See Fig.^) We are thus led to contemplate 
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a realization of quantum computation in which information is encoded in a form 
that can be measured and manipulated through Aharonov-Bohm interactions — 
topological interactions that are immune to local disturbances. 



Fig. 1. A topological interaction. The Aharonov-Bohm phase acquired by an 
electron that encircles a flux tube remains unchanged if the electron’s path is 
slightly deformed. 



It is useful to reexpress this reasoning in the language of superselection rules. 
A superselection rule, as we are using the term here, arises (in a held theory or 
spin system defined in an infinite spatial volume) if Hilbert space decomposes 
into mutually orthogonal sectors, where each sector is preserved by any local 
operation. Perhaps the most familiar example is the charge superselection rule 
in quantum electrodynamics. An electric charge has an infinite range electric 
held. Therefore no local action can create or destroy a charge, for to destroy a 
charge we must also destroy the electric held lines extending to infinity, and no 
local procedure can accomplish this task. 

The Aharonov-Bohm interaction is also an infinite range effect; the electron 
acquires an Aharonov-Bohm phase upon circling the solenoid no matter what its 
distance from the solenoid. So we may infer that no local operation can destroy 
a charge that participates in Aharonov-Bohm phenomena. If we consider two 
objects carrying such charges, widely separated and well isolated from other 
charged objects, then any process that changes the charge on either object would 
have to act coherently in the whole region containing the two charges. Thus, the 
charges are quite robust in the presence of localized disturbances; we can strike 
the particle with a hammer or otherwise abuse it without modifying the charges 
that it carries. 

Following Kitaev|25i we may envision a topological quantum computer, a de- 
vice in which quantum information is encoded in the quantum numbers carried 
by quasiparticles that reside on a two-dimensional surface and have long-range 
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Aharonov-Bohm interactions with one another. At zero temperature, an acciden- 
tal exchange of quantum numbers between quasiparticles (an error) arises only 
due to quantum tunneling phenomena involving the virtual exchange of charged 
objects. The amplitude for such processes is of the order of where m is 

the mass of the lightest charged object (in natural units), and L is the distance 
between the two quasiparticles. If the quasiparticles are kept far apart, the prob- 
ability of an error afflicting the encoded information will be extremely low. At 
finite temperature T, there is an additional source of error, because an uncon- 
trolled plasma of charged particles will inevitably be present, with a density 
proportional to the Boltzman factor where A is the mass gap (not neces- 

sarily equal to the “curvature mass” m). Sometimes one of the plasma particles 
will slip unnoticed between two of our data-carrying particles, resulting in an 
exchange of charge and hence an error. To achieve an acceptably low error rate, 
then, we would need to keep the temperature well below the gap A (or else we 
would have to monitor the thermal plasma very faithfully) . 

3 The Fractional Quantum Hall Effect and Beyond 

If our device is to be capable of performing interesting computations, the 
Aharonov-Bohm phenomena that it employs must be nonahelian. Only then will 
we be able to build up complex unitary transformations by performing many par- 
ticle exchanges in succession. Such nonabelian Aharonov-Bohm effects can arise 
in systems with nonabelian gauge fields. Nature has been kind enough to provide 
us with some fundamental nonabelian gauge fields, but unfortunately not very 
many, and none of these seem to be suited for practical quantum computation. 
To realize Kitaev’s vision, then, we must hope that nonabelian Aharonov-Bohm 
effects can arise as complex collective phenomena in (two-dimensional electron 
or spin) systems that have only short-range fundamental interactions. 

In fact, one of the most remarkable discoveries of recent decades has been that 
infinite range Aharonov-Bohm phenomena can arise in such systems, as revealed 
by the observation of the fractional quantum Hall effect. The electrons in quan- 
tum Hall systems are so highly frustrated that the ground state is an extremely 
entangled state with strong quantum correlations extending out over large dis- 
tances. Hence, when one quasiparticle is transported around another, even when 
the quasiparticles are widely separated, the many electron wave function acquires 
a nontrivial Berry phase (such as This Berry phase is indistinguishable 

in all its observable effects from an Aharonov-Bohm phase arising from a funda- 
mental gauge field, and its experimental consequences are spectacular [201 • 

The Berry phases observed in quantum Hall systems are abelian (although 
there are some strong indications that nonabelian Berry phases can occur under 
the right conditions |EZEE|), and so are not very interesting from the viewpoint 
of quantum computation. But Kitaevj23] has described a family of simple spin 
systems with local interactions in which the existence of quasiparticles with non- 
abelian Berry phases can be demonstrated. (The Hamiltonian of the system so 
frustrates the spins that the ground state is a highly entangled state with infinite 
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range quantum correlations.) These models are sufficiently simple (although un- 
fortunately they require four-body interactions), that one can imagine a designer 
material that can be reasonably well-described by one of Kitaev’s models. The 
crucial topological properties of the model are relatively insensitive to the precise 
microscopic details, so the task of the fabricator who “trims” the material may 
not be overly demanding. If furthermore it were possible to control the trans- 
port of individual quasiparticles (perhaps with a suitable magnetic tweezers), 
then the system could be operated as a fault-tolerant quantum computer. 




























































Fig. 2. A Kitaev spin model. Spins reside on the lattice links. The four spins 
that meet at a site or share a plaquette are coupled. 



To construct his models, Kitaev considers a square lattice, with spins re- 
siding on each lattice link. The Hamiltonian is expressed as a sum of mutually 
commuting four-body operators, one for each site and one for each plaquette 
of the lattice. (See Fig. |3) Because the terms are mutually commuting, it is 
simple to diagonalize the Hamiltonian by diagonalizing each term separately. 
The operators on sites resemble local gauge symmetries (acting independently 
at each site), and a state that minimizes these terms is invariant under the local 
symmetry, like the physical states that obey Gauss’s law in a gauge theory. The 
operators on plaquettes are like “magnetic flux” operators in a gauge theory, 
and these terms are minimized when the magnetic flux vanishes everywhere. 
The excitation spectrum includes states in which Gauss’s law is violated at iso- 
lated sites — these points are “electrically charged” quasiparticles — and states 
in which the magnetic flux is nonvanishing at isolated plaquettes — these are 
magnetic fluxon quasiparticles. The quantum entanglement of the ground state 
is such that a nontrivial Berry phase is associated with the transport of a charge 
around a flux — this phase is identical to the Aharonov-Bohm phase in the 
analog gauge theory. 

These Aharonov-Bohm phenomena are stable even as we deform the Hamil- 
tonian of the theory. Indeed, if the deformation is sufficiently small, we can study 
its effects using perturbation theory. But as long as the perturbations are local in 
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space, topological effects are robust, since perturbation theory is just a sum over 
localized influences. Whatever destroys the long-range topological interactions 
must be nonperturbative in the deformation of the theory. 

Two types of nonperturbative effects can be anticipated PHI- The ground 
state of the theory might become a “flux condensate” with an indefinite number 
of magnetic excitations. In this event, there would be a long-range attractive 
interaction between charged particles and their antiparticles. It would be impos- 
sible to separate charges, and there would be no long-range effects. In a gauge 
theory, this phenomenon would be called electric confinement. Alternatively, a 
condensate of electric quasiparticles might appear in the ground state. Then the 
magnetic excitations would be confined, and again the long-range Aharonov- 
Bohm effects would be destroyed. In a gauge theory, we would call this the 
Higgs phenomenon (or magnetic confinement). 

Thus, as we deform Kitaev’s Hamiltonian, we can anticipate that a phase 
boundary will eventually be encountered, beyond which either electric confine- 
ment or the Higgs phenomenon will occur. The size of the region enclosed by this 
boundary will determine how precisely a material will need to be fabricated in 
order to behave as Kitaev specifies. A particularly urgent question for the mate- 
rial designer is whether cleverly chosen two-body interactions might so frustrate 
a spin system as to produce a highly entangled ground state and nonabelian 
Aharonov-Bohm interactions among the quasiparticle excitations. 

The fractional quantum Hall effect, and Kitaev’s models, speak a memo- 
rable lesson. We And gauge phenomena emerging as collective effects in systems 
with only short range interactions. It is intriguing to speculate that the gauge 
symmetries known in Nature could have a similar origin. 



4 Topological Interactions 

As we have noted, in Kitaev’s spin models, there are two types of charges that 
can be carried by localized quasiparticles, which we may call “electric” and 
“magnetic” charges. In the simplest type of model, the “magnetic flux” carried 
by a particle can be labeled by an element of a finite group G, and “electric 
charges” are labeled by irreducible representation^ of G. If a charged particle in 
the irreducible representation whose quantum numbers are encoded in an 

internal wavefunction is carried around a flux labeled by group element 

u € G, then the wavefunction is modified according to 

. ( 1 ) 

Exploiting this interaction, we can measure a magnetic flux by scattering a 
suitable charged particle off of the fluxj3I]|. For example, we could construct 
a Mach-Zender flux interferometer as shown in Fig. 01 that is sensitive to the 

^ There can also be “dyons” that carry both types of charge, and the classification of 
the charge carried by a dyon is somewhat subtle, but we will not need to discuss 
explicitly the properties of the dyons. 
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relative phase acquired by the charged particle paths that pass to the left or 
right of the flux. If we balance the interferometer properly, we can distinguish 
between, say, two flux values ui,U2 G G; a ui flux will be detected emerging 
from one arm of the interferometer, and a U2 flux from the other arm. Of course, 
the interferometer we build will not be flawless, but the flux measurement can 
nevertheless be fault-tolerant — if we have many charged projectiles and perform 
the measurement repeatedly, we can determine the flux with very high statistical 
confidence. 




Fig. 3. A Mach-Zender interferometer for flux measurement, shown schemati- 
cally. The flux to be measured is inserted inside. The test charge emerges from 
one arm if the flux has value ui, the other arm if the flux has value U2- 



If the two fluxes ui and U2 belong to the same conjugacy class in G, then there 
is a symmetry relating the two fluxons, so that all local physics is indifferent to 
the value of the flux (see below) . Therefore, a coherent superposition of fluxes 

a\ui) + b\u2) (2) 

will not readily decohere due to localized interactions with the environment. But 
the flux interferometer (operated repeatedly) will project the fluxon onto either 
of the flux eigenstates |mi) (with probability |ap) or \u2) (with probability | 6 p). 

Now imagine that two fluxons have been carefully calibrated, so that one is 
known to carry the flux u\ and the other the flux U2- And suppose that the 
two vortices are carefully “exchanged” by carrying the first around the second 
as shown in Fig. 0 and that we subsequently remeasure the fluxes. Carrying a 
charged particle around the fluxon on the right, after the exchange, is topolog- 
ically equivalent to carrying the charged particle around first the right fluxon, 
then the left fluxon, and Anally the right fluxon in the opposite direction, before 
the exchange. We infer that the exchange modifies the quantum numbers of the 
fluxons according to 
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Fig. 4. The flux exchange interaction. The flux labeled ui is carried from its 
original position (shaded) to its new position (unshaded), and then remeasured. 
The charged particle path shown that encircles the original position of the flux 
is topologically equivalent to a path that encircles the new position; hence the 
value of the flux changes from ui to u'^ = U2^U\U2- 

\U1)\U2) \u2)\u2^UiU2) , (3) 

a nontrivial interaction if the two fluxes fail to commute j, 3 1 j. Thus, noncommut- 
ing fluxes have interesting Aharonov-Bohm interactions of their own, even in the 
absence of any electric charges. Because carrying one flux around another can 
conjugate the value of the flux, two fluxons carrying conjugate fluxes must be 
regarded as indistinguishable na.rticles j,32j. An exchange of two such objects can 
modify their internal quantum numbers; we will refer to them as nonabelions\lVA\ . 
indistinguishable particles in two dimensions that obey an exotic nonabelian 
variant of quantum statistics. 

We will use the exchange interaction Eq. as a fundamental logical oper- 
ation in our Aharonov-Bohm quantum computer. However, it will actually be 
convenient to encode qubits in pairs of fluxons, where the total flux of the pair is 
trivial | 21 ]. That is, we will consider fluxon-antifluxon pairs of the form |u, u ^), 
but where the flux and antiflux are kept far enough apart from one another 
that an inadvertent exchange of quantum numbers between them is unlikely. To 
perform logic, we may pull one pair through another as shown in Fig. 0 . Since 
the total flux that passes through the middle of the outside pair is trivial, this 
pair is not modified, but the inside fluxes are conjugated by the outside flux: 

\ui,ujj^)\u2,U2^) \u2,U2^)\u2'^UiU2,U2'^ujj'^U2) (4) 

an operation that is evidently isomorphic to the effect of the exchange of single 
fluxes described by Eq. (|nj. Using pairs instead of single fluxons has two ad- 
vantages. First, since each pair has trivial total flux, the pairs do not interact 
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Fig. 5. The “pull-through” interaction. One flux pair is pulled through another. 
The outside flux is unmodifled, but the inside flux is conjugated by the outside 
flux. 

unless one is pulled through another; therefore, we can easily shunt pairs around 
the device without inducing any unwanted interactions with distant pairs. Sec- 
ond, and more important, pairs can carry charges even if each member of the 
pair carries no charge |^4I35| . The charge of a pair can be measured, and this 
charge-measurement operation will be a crucial ingredient in the construction 
of a universal set of quantum gates. The operation Eq. © can be regarded as 
a classical logic gate; it takes flux eigenstates to flux eigenstates. To perform 
interesting quantum computations, we will need to be able to prepare coherent 
superpositions of flux eigenstates. This is what we can accomplish by measuring 
the charge of a pair. 




O • • 

NOT 

pair 



Fig. 6. The NOT gate. Pulling a computational flux pair through a NOT pair 
flips the value of the encoded bit. 

Suppose that uq and ui G G are related by ui = v~^uqv for some v G G. Then 
if we think of the flux eigenstates |uo, and |ui, as computational basis 
states, the effect of pulling either pair through a pair can be interpreted 

as a NOT (or ax) gate: 






( 5 ) 
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Fig. 7. A Mach-Zender interferometer for charge measurement, shown schemat- 
ically. The flux pair whose charge is to be measured is inserted inside. If the test 
NOT flux emerges from one arm, the |-|-) charge state has been prepared; if it 
emerges from the other arm, |— ) has been prepared. 

(see Fig. El). But suppose we wish to prepare one of the states 



We can project a coherent superposition of |uo, and |ui, onto the {|±)} 
basis by scattering a |u) fluxon off the pair, or in other words by operating a 
charge interferometer, as in Fig. 0 When the |u) fluxon navigates around the 
pair, it acquires a trivial Aharonov-Bohm phase if the pair is in the state |-|-) 
and the nontrivial phase —1 if the pair is in the state |— ). If the interferometer 
is properly balanced, then, the |u) projectile will be detected emerging from one 
arm of the interferometer if the pair is |-|-), and the other arm if the pair is |— ). 
This is an example of charge measurement. Though the interferometer will not 
be perfect, charge measurement (like flux measurement) can be fault-tolerant, if 
we repeat the measurement enough times. 

5 Universal Topological Computation 

Working with fluxon pairs as computational basis states, we have seen how to 
perform the exchange (or “pull through”) operation Eq. Q), how to measure 
flux (using previously calibrated charges), and how to measure charge (using 
previously calibrated fluxes). We will also suppose that we are able to produce a 
large supply of vortex pairs. Local processes produce pairs that carry no charge 
or flux; a charge-zero pair with trivial flux has the form (up to normalization) 




(6) 



[charge zero) = ^ \u,u 



(7) 



U 
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where the sum ranges over a complete conjugacy class of G. Because this state 
is left invariant when conjugated by any element of G, it has trivial Aharonov- 
Bohm interactions with any flux, and so carries no detectable charge. After 
producing such a pair, we can perform flux measurement to project out one 
of the flux eigenstate pairs Performing many such measurements on 

many pairs, we can assemble a large reservoir of calibrated flux pairs that can 
be withdrawn as needed during the course of a computation. 

But is our quantum computer universal — can we closely approximate any 
desired unitary transformation? To address this issue, we appeal to a theorem 
proved by Cottesma.njl ti|. Suppose that we can perform any classical reversible 
operation; that is, any unitary transformation on n qubits that merely permutes 
the 2" computational basis states. Then to achieve universal quantum computa- 
tion, it is sufficient to be able to perform a few simple operations on individual 
qubits: the single-qubit gate and measurement of the single-qubit observables 
(Jx, <Jy, and az - In other words (if we envision the qubits as spin-^ objects), once 
we have a universal classical gate at our disposal, we can build a universal quan- 
tum computer if we are able to rotate a spin by 180° about the z axis 0 and can 
measure the spin along the x, y, and z axes. 

In fact, there are groups G such that the operation Eq. (gj) is sufficient for 
universal classical computation. The three-bit Toffoli gate, with action 

Toffoli : |a, 6 , c) |a, 6 , c © ab) ( 8 ) 

on a,b,c G {0, 1}, is a universal classical gate. We have found that a Toffoli gate 
can be constructed from Eq. O if G = A 5 , the group of even permutations on 
five objects. We may, for example, choose computational basis states with 

uo = (125) , ui = (234) ; (9) 

that is, we choose our computational fluxes to be three-cycles with one object 
in common. Then a Toffoli gate can be constructed from a total of 16 elemen- 
tary “pull-through” operations; six ancilla pairs are also used to catalyze this 
reaction. No Toffoli gate was found in any group smaller than AsS Since A 5 is 
also the smallest of the finite nonsol vable groups, it is tempting to conjecture 
that nonsolvablility is a necessary condition for universal classical computation 
generated by conjugation .0 

We have already remarked that an ax gate can be realized by pulling a com- 
putational vortex pair through the pair with flux v such that ui = v~^uqV] here 
we choose v = (14) (35). It turns out that the CTz gate can be constructed with 

^ Since is a classical gate, and ioy = a^ax, it follows that we can perform 180° 
rotations abont each of the x, y and z axes. 

® Kitaev had reported earlier that universal classical computation is possible for G = 

55. 

A finite group is nonsolvable if it has a nontrivial subgroup whose commutator sub- 
group is itself. BarringtonjSS] also found evidence for a separation in the computa- 
tional complexity of group multiplication for solvable vs. nonsolvable groups. 
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six pull-through steps and four ancilla pairs. Measuring Gz is the same as mea- 
suring flux, and we have already seen that Gx measurement can be achieved by 
measuring the charge of a pair, specifically, by using a v projectile in a charge 
interferometer. It only remains to verify that we can measure ay. Though ay 
measurement cannot be carried out exactly in this scheme, it turns out that 
a controlled-ay gate can be constructed from 31 pull-through steps, and using 
7 ancilla pairs. Appealing to another trick invented by Kitaev l37l . we can use 
the controlled-CTy gate repeatedly to carry out Uy-measurement to any desired 
accuracy^ Therefore, we have constructed a universal gate set using only the 
Aharonov-Bohm interactions of fluxes and charges; we have a fault-tolerant uni- 
versal quantum computer. 

Unfortunately, the spin model on which this construction is based is not so 
simple. Since the group A 5 has order 60, the Kitaev spin model that realizes 
this scenario has a 60-component spin residing at each lattice link (!) One hopes 
that a simpler implementation of universal Aharonov-Bohm computation will be 
found. 

The fabrication of materials that emulate Kitaev’s spin systems may lie far 
in the future. And even when such materials are available, there will be further 
challenges to the machine designer, such as finding a reliable way to shepherd 
individual quasiparticles along prescribed trajectories. In the nearer term, it is 
interesting to consider whether nontrivial quantum information processing might 
be feasible in existing quantum Hall systems. Furthermore, even if we are unable 
to operate an actual spin system as a quantum computer, a quantum cellular 
automaton that simulates the spin system may provide a promising paradigm 
for fault-tolerant quantum computation. 

6 Is Nature Fault Tolerant? 

The discovery of quantum error correction and fault tolerance has so altered our 
thinking about quantum information that it is appropriate to wonder about the 
potential implications for fundamental physics. And in fact, a fundamental issue 
pertaining to loss of quantum information has puzzled the physics community 
for over twenty years. 

In 1975, Stephen Hawking|2B! argued that quantum information is unavoid- 
ably lost when a black hole forms and then subsequently evaporates completely. 
The essence of the argument is very simple: because of the highly distorted 
causal structure of the black hole spacetime, the emitted radiation is actually 
on the same time slice as the collapsing body that disappeared behind the event 
horizon. If the quantum information that is initially encoded in the collapsing 
body is eventually to re-emerge encoded in the microstate of the emitted infor- 
mation, then that information must be in two places at once. In other words, 
the quantum information must be cloned, a known impossibility under the usual 
assumptions of quantum theory ( 231 ]|. Hawking concludes that not all physical 

® Actually, what we really construct is a controlled-w gate where ui = 

which is also adequate for measurement of ay. 
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processes can be governed by unitary time evolution; the laws of quantum theory 
need revision. 

This argument is persuasive, but many physicists are very distrustful of the 
conclusion. Perhaps one reason for the skepticism is that it seems odd for Na- 
ture to tolerate just a little bit of information loss^J. If processes involving 
black holes can destroy information, then one expects that information loss is 
unsuppressed at the Planck length scale ~ 10“^^ cm, a scale where 

virtual black holes continually arise as quantum fluctuations. It becomes hard to 
understand why quantum information can be so readily destroyed at the Planck 
scale, yet is so well preserved at the much longer distance scales that we have 
been able to explore experimentally — violations of quantum mechanics, after 
all, have never been observed. 

Our newly acquired understanding of fault-tolerant quantum computation 
provides us with a fresh and potentially fruitful way to think about this problem. 
In Kitaev’s spin models, we might imagine that localized processes that destroy 
quantum information are quite common. Yet were we to follow the evolution of 
the system with coarser resolution, tracking only the information encoded in the 
charges of distantly separated quasiparticles, we would observe unitary evolution 
to remarkable accuracy; we would detect no glimmer of the turmoil beneath the 
surface 0 

Likewise, it is tempting to speculate that Nature has woven fault tolerance 
into her design, shielding the quantum noise at the Planck scale from our view. 
The discovery that quantum systems can be stabilized through suitable coding 
methods prompts us to ask the question: Is Nature fault tolerant? If so, then 
quantum mechanics may reign (to excellent accuracy) at intermediate length 
scales, but falter both at the Planck scale (where “errors” are common) and at 
macroscopic scales (where decoherence is rapid). 
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Abstract. We describe the creation of a Greenberger-Horne-Zeilinger 
(GHZ) state of the form (|000) -I- |lll))/\/2 (three maximally entangled 
quantum bits) using Nuclear Magnetic Resonance (NMR). We have suc- 
cessfully carried out the experiment using the proton and carbon spins 
of trichloroethylene, and confirmed the result using state tomography. 
We have thus extended the space of entangled quantum states explored 
systematically to three quantum bits, an essential step for quantum com- 
putation. 



We live in a world which, to the best of our knowledge, is surprisingly well 
described by the laws of quantum mechanics. Fundamental processes in nature 
are all compatible with quantum mechanics, even if some of its predictions are 
counterintuitive. 

A good example of this peculiar behavior is given by the now famous pairs 
of two state systems described by Einstein, Podolosky and Rosen (EPR) P . The 
unorthodox behavior of this composite system has been crystallized by the Bell 
inequalities P), which give a statistical test for the existence of elements of reality 
0. In our world, photons in a polarization state of the form (|00) — |ll))/\/2 
violate these inequalities as shown by Aspect et al.p and therefore contradict 
the existence of elements of reality. 

In a beautiful paper, Greenberger, Horne and Zeilinger (GHZ) demon- 
strated that it is possible, in a single run of experiments using a state of the 
form |000) -I- |111) (the GHZ state), to refute the existence of elements of real- 
ity. The GHZ experiment has been succinctly summarized by Mermin0. Even 
though this experiment is very appealing, nobody has yet been able to perform 
it. The reason is that it is rather difficult to precisely manipulate entangled 
quantum states of many particles. In fact, up to now, entangled pure states of 
only two particles have been systematically explore dmm- 

In this letter we show how three particle entangled states can be realized using 
nuclear magnetic resonance techniques. We have carried out the experiment and 
verified that indeed we had a GHZ state by using tomographyp3|. We first 
describe how it is possible to obtain a pure state result from the initial mixed 
density matrix of a NMR system. Then we explain the sequence of operations 
used to obtain a GHZ state and finally we give the experimental results. 

We will not investigate the non-local behavior of GHZ states, as NMR is not 
appropriate for this undertaking. What we have created, however, is a typical 
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state needed for a three-bit quantum computer. Indeed our approach is the one 
used for quantum computation and we refer the reader to El for an introduction. 

The usual approach for quantum computation requires an initial pure 
state HD . The computation itself is a transformation of this initial state using 
unitary operations. Useful information is then extracted by a measurement of 
the final state. However, it has recently been shown that the computation can 
be performed using an initial mixed state as long as the decoherence time is 
sufficiently long ITiH ,111 411,^1 . 

In liquid state NMR, the computation takes place on a large ensemble of 
identical quantum systems. Each member of the ensemble of quantum systems 
consists of the interacting nuclear spins of a molecule in a high magnetic field. 
The initial state of the nuclear spins is achieved by allowing the system to relax 
to thermal equilibrium. Information processing with such an ensemble can be 
divided into three steps consisting of preparation, computation and readout. 
Each of these steps is equivalent to an application of certain quantum operations 
identically to each member of the ensemble. 

The nuclear spins are manipulated by applying radio frequency (RE) pulses 
tuned to the Larmor frequencies of the spins PI- The spins can be selectively ex- 
cited by exploiting differences in Larmor frequencies. Entanglement and two spin 
operations are achieved by a delay to allow for interaction between spins. In our 
case, these interactions are scalar couplings which can be selectively turned off 
by the use of refocusing pulses tuned to one of the nuclei. Any unitary quantum 
operation can be decomposed into such operations jnEHm. 

The measurement step in NMR consists of observing the signal induced in 
RE coils by the precession of the nuclear spins m- In effect, we measure the time 
evolution of the expectation of the ax and ay operators for each nuclear spin. By 
Fourier transforming the signal and analyzing the spectrum, other operators such 
as tensor products of either ax or ay with / or a^ for pairs of interacting spins 
can be measured. The traceless part of the density matrix (called the deviation 
matrix) can be determined by a tomography procedure |1()| . This involves several 
measurements of spectra after applying different reading pulses to the final state, 
thus permitting observation of all elements of the deviation matrix. 

The main problem in using NMR for the observation of multiparticle entan- 
glements and quantum computation is that the sample is initially in a highly 
mixed state. There are several methods for overcoming this limitation without 
cooling the sample. One idea is to transform the initial mixed state so that we 
have a pseudo-pure state 

Ppp=p|0...0)(0...0| + i^7, (1) 

where N is 2" for n qubits and p -I- (1 — p)/N is the probability of the ground 
state (|0 ...)). In NMR this state is indistinguishable from a pure state because all 
observables are traceless. The methods discussed in the literature [II 21 1 ,31 1 41 1 ,5) 
are all based on the idea of viewing the non-ground state components of the 
initial thermal density matrix as noise and applying an averaging method to 
eliminate their contribution to measurements. These methods are in principle 
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sufficient for exploiting the advantages of quantum computing in an ensemble 
setting. In our experiments we used a new method for extracting pseudo-pure 
states from experimental data. The method is based on two features of our 
experiment. The first is that the sequence of pulses and delays applied to the 
sample for preparing the GHZ state are much shorter than the decoherence 
time for the nuclear spins (about 5ms versus at least 210ms). As a result, the 
evolution is very close to unitary and therefore preserves the eigenvalue structure 
of the initial state. This structure is well known and determined by the thermal 
distribution. The unitarity property can be tested by comparing the eigenvalues 
of the output state to those of the thermal distribution. The second feature is 
that complete state tomography is used to analyze the output of the experiment. 
This allows us to decompose the output state into its eigenstates. The state 
1 000) is associated with the smallest eigenvalue of the thermal state and must be 
transformed by unitary evolution into the corresponding eigenstate of the output. 
By comparing this eigenstate to the desired GHZ state, we learn whether our 
pulses indeed generated this state from that component of the ensemble which 
was initially in |000). 



Cl Cl 




Fig. 1. Pictorial representation of trichloroethylene. The three qubits are given 
by the nuclei of hydrogen (H) and of the two carbon 13’s (Ci and (72). The last 
two are distinguishable because of the assymmetry of the chlorine environment. 
The chlorine nuclei have spin 3/2 and their interaction with the qubits can be 
neglected. 

In order to create a GHZ state we need three nuclei. A convenient system 
is trichloroethylene, the molecule shown in Figure 1. The spins of the hydrogen 
and carbon nuclei were used for the three quantum bits. They interact only 
weakly with the chlorine nuclei, which can therefore be ignored. In a strong 
static magnetic field (11.5 Tesla along the z-axis) the evolution of the hydrogen 
and the two carbon nuclei are well described by the Hamiltonian 

H = -iVHcr^ - wcicrfi - + 

•^CiC2(o-f'crf" -k -k erf 1 erf") -k 



(2) 
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with iOH « 500.1334915 MHz, ujc, « 125.7725805 MHz, ujc^ « 125.7732305 
MHz, which gives a chemical shift of 650 Hz between the carbons. The J cou- 
plings have values of JhCi ~ 203 Hz, JciC 2 ~ 102 Hz and Jhc^ ~ 10 Hz- 

The data was acquired with a Bruker DRX-500 spectrometer using dou- 
bly labeled trichloroethylene (72,99%). The relaxation time (Ti) for the 

hydrogen is 7s and 30s for the carbons. The phase decoherence time (T 2 ) for 
hydrogen is 3s and 0.4s and 0.2s for Ci and C 2 respectively. 
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Fig. 2. a) Circuit to create a GHZ state. Ra{9) corresponds to a rotation of the 
qubit by an angle 6 around the a-axis and the two other gates are CONTROL- 
NOTs (see DiVincenzo[14]). b) Implementation by an NMR pulse sequence pro- 
ducing a GHZ state (in the rotating frame of H and C\). The delay d\ serves 
to let the coupling between the nuclei create the engtanglement, and d ,2 is used 
to generate a phase shift on the C 2 nucleus. Coupling between the carbon nuclei 
during this delay is negligible. All pulses are non-selective. 



A simple circuit to create a GHZ state consists of a rotation by 7t/2 around the 
y-axis followed by two CONTROL-NOT gates on the other qubits. Adapting this 
to our system results in the pulse sequence shown in figure 2 b). The deviation 
density matrix, initially given by the state 

p\ = ujhc^z G) 1 1 + wci 1 G) erf ^ 0 1-1- WC 2 1 G) 1 G) erf^ (3) 

is transformed to 

pf = -WffCr^ (g) crfi (g 1 - LOCi<J^ (g crfi (g 1 

-WC 2 I erf' erf' (4) 

The tomography procedure was implemented with twelve sets of reading 
pulses. For each set, two experiments were performed to read off the hydrogen 
and the carbon spectra. 
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To compute the peak intensities for each spectrum, we assumed that each 
peak is approximately Lorentzian. The peak positions and decay parameters as- 
sociated with the Lorentzian shape were obtained by optimizing (in the least 
squares sense) the match to the calibration spectra. The carbon and hydrogen 
spectra were both matched to within 2%. This should be compared to the esti- 
mated noise, which was determined to be less than .5% for the carbon and .01% 
for the hydrogen nuclei. The mismatch is due to additional peaks, primarily from 
unlabeled compound, and also to shimming problems causing deviation from the 
ideal Lorentzian lineshape. The decay parameters (essentially T|) obtained af- 
ter optimization were 0.51 ± O.OSsec for the hydrogen nucleus, 0.41 ± O.Olsec 
for carbon 1 and 0.23 ± O.Olsec for carbon 2 (the error bars are estimated from 
the variation between different peaks of the same nucleus) . The latter agree well 
with the experimentally determined T 2 time for the carbons. Since T 2 for the 
hydrogen is near 3sec, it can be seen that substantial peak broadening due to 
magnetic field inhomogeneity reduces the hydrogen T^. 

After the peak positions and decay parameters were determined, each spec- 
trum of the experiment was deconvolved as a linear combination of the ideal 
peaks. The coefficients of the combination yield the intensity and phase of each 
peak in the spectrum. These numbers were then normalized by the calibration 
intensities and phase corrected using the calibration phases. Then they were used 
to determine the deviation density matrix of the output state of the experiment. 

An idea of how unitary the evolution has been is obtained by comparing 
the eigenvalues of the initial deviation matrix ({24, 16, 16, 8, —8, —16, —16, —24}) 
to the final one ({24.1,16.3, 15.5,7.7, -8.1, -15.0, -16.5,-23.8}). Clearly the 
transformation was unitary to a very good approximation . 

Using the procedure explained above we get, after diagonalizing and taking 
the appropriate eigenvector, the experimentally determined GHZ state 
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A pictorial representation is given in figure 3. The fidelity of the state compared 
to the ideal GHZ state is 

T = {'I'GHz\pt^^\'l'GHz) = 0.95. 



( 6 ) 
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Fig. 3. Pictorial representations of the theoretical and experimental density 
matrices p for the GHZ state |000) + a), b) Real and imaginary parts of 

the theoretically determined density matrix, c), d) Real and imaginary parts 
of the experimentally measured density matrix. Each bar graph represents the 
value of the transitions between the eight states |000) . . . |111) by the height of 
the bar in the corresponding position of the 8x8 array. 



In conclusion, we have shown how to construct an effective GHZ state with a 
fidelity of 95% in NMR starting with the thermal mixed state. The experiments 
demonstrate the ability in NMR to fully explore the state space of multi-particle 
systems, which is all that is required for quantum computation. By using a four 
spin system, the paradoxical output of the originally proposed GHZ experiments 
can be observed as proposed in m- However, due to the microscopic separation 
of the particles involved and the method for observation used, this would not be 
a true test of the existence of elements of reality. 

To construct our GHZ state we developed a new method for extracting 
pseudo-pure states from NMR spectra. The method can be used efficiently for 
process tomography since each experiment determines the transformation 
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of the state space induced by the applied operation on each of the eigenstates of 
the input state. 

Our experiments demonstrate that room temperature liquid NMR is well 
suited for quantum computations involving small numbers of qubits. Although 
manipulating three qubits is a small step for large scale quantum computation, 
it is the first time that a quantum network has been used to systematically 
entangle more than two qubits. 
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Abstract. We consider the use of Electron-Nuclear Double Resonance 
(ENDOR) techniques in quantum computing. ENDOR resolution as a possible 
limiting factor is discussed. It is found that ENDOR and double-ENDOR 
techniques have sufficient resolution for quantum computing applications. 



1. Introduction 

Recently, one of the authors presented arguments that the Electron Nuclear Double 
Resonance (ENDOR) process may be exploited for the storage and processing of 
discrete data on a quantum scale 1“^. The implication is that a solid state realization 
of a quantum mechanical computer could be engineered. Among the advantages are 
that such a solid state quantum computer would be stable, programmable, and 
input/output (I/O) controllable by current state-of-the-art technology. It could be 
envisioned, in principle, to be “engineerable” for considerably lower per unit cost 
than quantum computers operating on the principle of induced quantum superposition 
and entangled states of trapped ions^, or photon states using microcavities^. 
Eurthermore, solid state ENDOR is a well established procedure (established by 
Eeher^ in 1959), and laboratory components are commercially available at reasonable 
costs. The present paper is a proposed novel quantum computing paradigm based 
upon the use of multipulse resonance techniques to manipulate nuclear spins of a 
mostly relatively low dimensional ensemble deviation from thermal equilibrium^O 
The new paradigm builds upon a previously proposed paradigm which utilizes well 
established techniques from nuclear magnetic resonance (NMR) spectroscopy^^. An 
obvious advantage to using superposition of nuclear spins for quantum logic gates, 
and nuclear spin flips to conduct quantum computing, is the possibility of 
extraordinary long decoherence times due to the relative isolation of nuclear spins 
within a molecule. A disadvantage is related to this, and that is the length of time 
required to couple information in and out of a system and manipulation during 
computation. Another disadvantage is that nuclear spin flips are induced at radio 
frequency (rf) wavelengths, and so quantum computation is restricted to temporal 
unitary evolution and is entirely non-local. 
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In the present paper, we propose a scheme based upon Electron Nuclear Double 
Resonance (ENDOR) as a means to practical quantum computation. The proposed 
scheme builds from the previous proposals which use NMR spectrographic 
techniques^^’ll, but trades reduction in decoherence time by electron spin, nuclear 
spin coupling, but gains in high I/O bit rates and stronger coupling to manipulate 
computation. Also, we shall point out that sequential spatially dependent architecture 
is possible using laser electronic excitation to manipulate electron spin coupling to 
nuclear spins. 

A brief discussion of Electron Spin Resonance (ESR) and ENDOR will be 
presented in the next section. Our novel paradigm for quantum computing using 
laser-induced electronic excitation and ENDOR will be discussed in Section 3, and 
Section 4 will be used for summary and conclusion. 



2. ESR and ENDOR Background 

The theory of ESR is derivable from the Dirac theory and will not be treated here, but 
may be found in the literature^^^’ 12-13. Here, we simply state the essential spin 
Hamiltonian and discuss its interaction terms, 

Ham= pHgS-pj.Kg^l+X'LS + SAl (1) 

The first and second terms correspond to the Zeeman energy contributions due to the 
electron sin, S, and nuclear spin, I, coupling to the magnetic field H. Here, g is the 
coupling tensor in units of the Bohr magneton (3 and is the nuclear spin coupling 
tensor in units of the nuclear magneton, (3jq. The third term in the Hamiltonian is the 
Zeeman interaction associated with the coupling between the magnetic moment due to 
the electron’s intrinsic spin and that due to the electron’s orbital momentum in a 
bound state. This term contains information about electronic defect states with 
different microscopic properties through measurement of shifts in g-value of a 
paramagnetic site. The last term is the hyperfine term which expresses the magnetic 
interaction between the nuclear and electronic spins due to overlap of the electronic 
wave functions with nuclear spins. This term, governed by the interaction tensor A, 
depends upon nuclear spin contact interaction with electronic wave functions and can 
be nearly isotropic, as with nearly s-type electronic orbitals, or anisotropic as for p- or 
d-like orbitals. By analyzing the energy contributions in the hyperfine term, the 
nature of the spin center (type of electronic state) can be determined. 

Electron Nuclear Double Resonance (ENDOR) provides the capability to more 
closely examine the anisotropic hyperfine interaction in terms of the atomic and 
electronic interactions at the paramagnetic centers. Ions or free radicals trapped in a 
solid lattice experience perturbations in their energy levels as expressed by this 
matrix. These perturbations can affect the spin transition dynamics of the 
paramagnetic species, and be detectable by Electron Spin Resonance (ESR). ENDOR 
allows the hyperfine and spin lattice relaxation phenomena to be measured by 
detecting the Nuclear Magnetic Resonance (NMR) signal as a change in the ESR 
spectrum. 

In ENDOR, the nuclear spins are modulated by addition of a transverse rf field, 
while the electron spins are driven by a transverse microwave (MW) field. A 
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simplified version of the ENDOR process can be illustrated by the simplified version 
ofEq. (1), 

Ham=gl5 Hg S^+aS l-gf^Pi^Hg (2) 

where we have neglected spin-orbit coupling, which in many cases is quenched^^. 
The eigenenergies associated with H, Eq. (2), in terms of the appropriate quantum 
numbers are given by 

,nii) = g P nig + a nig m, (3) 

where we assume that the electronic Zeeman energy, given by the first term, and the 
hyperfine energy, the second term in Eq. (3), are much larger than the nuclear Zeeman 
energy. The corresponding energy level structure, together with the ordering of the 
energy levels and transitions in terms of the quantum numbers is illustrated in Eig. 1 . 



H = g p Ho Sz + a S • I - gn Pn Ho Iz 



E( ms, m| ) = g p Ho ms + a ms m| 



mi = + 1/2 




E1 = A 4- 5 



E2 = A - 6 



A = g p Ho 



E3 = -A + 6 5 = a 

E4 = - A - 5 



h(o=E 2 -E 4 = 2A 



Eig. 1 Energy level structure and transitions from Equ. (1). 



Provided 5 *0, transitions 14) ^ |2) and 13) — > II) cause simultaneous electron-nuclear 
double spin flips at the transition energy -co = 2A. Whereas, transitions 14) ^ II) and 
13) ^ 12) correspond to electron spin flips only, but at the transition energy -co = 2A - 1 - 
25. The usual ENDOR procedure requires that the electronic transition be saturated 
using a microwave field at frequency co = 2A / // ; then nuclear spin flips are induced 
with an rf field at frequency Q = 25 / h and appear as modifications in the electron 
spin-resonance spectrum. 

To date, the most prevalent application of the ESR process in solid state materials 
is the determination and characterization of defect structures. As discussed above, the 
hyperfine interaction between the magnetic moments of an unpaired electron and 
neighboring nuclei can yield this information. The hyperfine interaction is sometimes 
not well resolved due to lattice phonon modes. This is especially true of the super- 
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hyperfine interaction or ligand hyperfine interaction that appears in the ESR spectrum 
as an interaction between the magnetic moment of an unpaired spin and its nearest 
neighbor nuclei. To the untrained eye, this broadening effect appears as a 
fundamental resolution limitation of magnetic resonance techniques. However, in the 
ENDOR process, the NMR transitions of neighboring nuclei interacting with the 
unpaired spin are measured by detecting their influence on the unpaired spin’s 
polarization under favorable signal-to-noise experimental conditions (partially 
saturated spin-resonance condition). These ENDOR-detected NMR transitions are 
detected as quantum mechanical transitions of much higher energy than would be 
observed in conventional NMR or ESR techniques. This implies that there are far 
fewer lines in the ENDOR spectrum to resolve than in the conventional ESR or NMR 
spectra. This is shown schematically in Figure 2 for the spin 1/2 system where we 
note that the ENDOR transition (cross transition) is larger than, and fewer in number 
than, either the ESR or NMR transitions. The effect is more noticeable for higher 
order spin systems (/ = 3/2, ..., etc.). This enhancement has been reported 
experimentally in SrFCl and BaFCl systems^^-^^. 

A strong advantage associated with the paradigm presented here is the possibility 
to control the transferred hyperfine interaction, the tensor A in Eq. (1), used as a 
generic example and illustrated in Fig. 3a. Here, an unpaired electron orbital 
associated with atom. A, is represented as an s-state in the electronic ground state, 
without any overlap of the wave function at the nucleus of atom, B. Laser excitation 
of atom, A, on the other hand, induces an electronic transition to a p-orbital or d- 
orbital, with consequent overlap of the electron wave function at the nucleus, B, 
inducing 5)^, electron spin, nuclear spin interaction, as shown in Fig. 3b. If a 
microwave field is tuned to the transition shown in Fig. 1, a simultaneous electron 
spin S^, nuclear spin, Ig, transition, double spin flip is induced. Thus, nuclear spin 
flips can be controlled in atom, B, by electron spin flips controlled by laser-induced 
transferred hyperfine interaction 

This scheme constitutes a significant modification of the NMR quantum 
computation of Refs. 10-11. Here, we use that paradigm to build a controlled NOT- 
gate conditional on a reference nuclear spin, but introduce laser field-induced electron 
spin, nuclear spin coupling by laser-induced transferred Fermi contact interaction. 
Thus, input/output and control can be executed locally under unitary time evolution, 
U = exp(-i Ham At /ft), and the laser field can be used to induce 7t-pulse 

excitation/de-excitation of duration At in sub-picosecond time scale. Thus, deco- 
herence can be minimized and control can be executed on the ultrafast time scale. In 
essence, we trade off some decoherence for fast local control. 

It is also possible to utilize Double-ENDOR (D-ENDOR) effects to increase the 
spectral resolution. In D-ENDOR, two NMR transitions are stimulated while the ESR 
transition is measured. In this case, typical improvements of a factor of 10; 1 in 
resolution are obtained^^. In addition. Optical Detected Magnetic Resonance 
(ODMR) has the potential for increased output resolution. 
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Fig. 2 a) Endor spectra of F(Cl-) and F(F-) centers in BaFCl for BO 150 off c in the 
c-a plane, b) Double ENDOR spectrum for setting fl to an ENDOR line of E(C1) 
centers (see mark on Fig. 2a). c) Double Endor spectrum for setting fl to an ENDOR 
line of F(F-) centers (see mark in Fig. 2a). [From Reference 15, J. R. Niklas, R. U. 
Bauer, and J. M. Spaeth, Phys. Stat. Sol. (b) 119, 171 (1983).] 



3. Quantum Logic Circuits 



Recently, quantum mechanical Hamiltonians for logic gate elements have been 
derived for: NOT^^, exclusive OR^^’^O^ controlled-NOT (C-NOT)^O operations. 
The former papers concerned theoretical derivations while in the latter paper by 
Gershenfeld and Chuang, NMR transitions were experimentally demonstrated to 
realize a C-NOT operation in hardware. This experiment, when taken along with 
earlier workl"^’ 18-20^ leads us to propose an ENDOR-based process paradigm for 
realization of higher complexity quantum logic gates. 




Quantum Computing Using Electron-Nuclear Double Resonances 



369 




Fig. 3a Electronic S - state; A=0, ground state, no overlap. 




Consider a spin system consisting of a “free” spin (electron or hole) in some 
photoactive crystal or polymer, its nearest neighbor nucleus B and its second nearest 
neighbor nucleus C, as illustrated in Figure 4. The magnetic moments of B and C are 
assumed distinct. Spin-spin interaction between 5^ and Ij^ and and Iq is defined 
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Fig. 4 Double ENDOR system; see text. 



by the anisotropic hyperfine tensor A [Eq. (1)]. Stimulating nuclei B and C by 
separate rf wavelengths Vq and V£ corresponding to NMR transition frequencies 
while simultaneously stimulating by microwave radiation sets up a D-ENDOR 
system. Stimulation of 5^ by external laser radiation, Vj_^, serves as a control to flip 
the spin states of the D-ENDOR system. Alternately, Ij^ or Iq could be perturbed by 
IR lasers, but for simplicity, we assume only 5^ laser stimulation. This controlled 
stimulation thus realizes a controlled NOT gate. Consider the spin interaction to he 
coupled to a chain of atoms {D, E, N). Repeated pulsing of 5^ would cause the 

quantum chain to respond as the quantum mechanical analog of a chain of emitters as 
shown in Figure 5. If A is even, then there is no net change in spin state at output, or 
lup) — > lup). If A is odd, lup) ^ Idown). 






1 ^^ — < 



READ 

OUT ODMR 



Fig. 5 Spin coupled to a chain of atoms; N even, no change at output, N odd, spin flip 
at output. Constitutes a controlled NOT gate. 
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4. Conclusion 

We have proposed a new paradigm for quantum computing which begins with the 
construction of a quantum controlled- NOT gate as prescribed in Reference 10, which 
uses a non-equilibrium ensemble of nuclear spins. The experiments of Gershenfeld, 
et al. have demonstrated exceptionally long decoherence times due to the relative 
isolation of nuclear magnetic moments to externally induced transitions. Our scheme 
introduces a higher-order process by coupling electron, nuclear double resonant 
super-hyperfine transitions, controlled by laser-induced electronic transitions. The 
advantages of this scheme are that each complex molecule becomes, in and of itself, a 
quantum computer and the entire system represents massive parallelism. In addition, 
we have shown that laser-induced electronic excitation renders local control of gate 
preparation, spin flips, and input/output which can take place on the sub-picosecond 
time scale. Here, we trade long decoherence times, intrinsic with nuclear spin flips, 
with controlled coupling with electron spin flips by transferred or super-hyperfine 
interactions. The latter is regulated, i.e., on or off, by laser field 7i-pulse electron 
excitation/de-excitation. We hope that this will lead to near-term experimental 
investigations. 
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Abstract. An overview of our work on quantum communication in 
quantum networks is given. We discuss physical implementations for quan- 
tum networks based on cavity-QED, including error correction schemes 
for the transmission of qubits over noisy channels and for noisy local en- 
tanglement and joint measurement operations on atoms. We define the 
’’photonic channel”, and outline the most recent developments concern- 
ing the ” quantum repeater” . 



1 Introduction 

Quantum Communication requires the ability to prepare, manipulate and mea- 
sure single quantum bits. Here we will discuss a quantum-optical implementation 
for Quantum Communication based on elements that have been realized experi- 
mentally, or that can be expected to be realized in the near future: trapped ions 
or atoms, and cavity quantum electrodynamics (CQED) both in the optical and 
microwave regime Q. For instance, among the recent experimental highlights 
in these fields are the first steps towards the implementation of quantum-logical 
operations, such as two-qubit operations that form the building blocks for quan- 
tum computing. Thus, one has performed a universal 2-bit quantum gate, using 
the quantized center-of-mass motion of a trapped ion and its internal quantum 
degrees of freedom ^ . In CQED the basic elements of gate operations involving 
the internal state of an atom and a cavity mode have been demonstrated |3|, 
and recently one has succeeded in entangling internal states of two atoms 0. 

Given this progress we expect to see in the lab within the next few years small 
or mesoscopic quantum (optical) systems consisting of a few particles (quantum 
bits). It is, therefore, important to identify applications of quantum mechanics 
and quantum information which can be realized with finite resources in direct 
relation to these new coming generations of experiments. In the present paper we 
will discuss the possibility of implementing quantum communication in (small) 
quantum networks. Such a quantum network involves nodes in which qubits are 
stored and locally processed. These nodes are connected by quantum wires (such 
as an optical fiber) over which communication takes place by sending photons. 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 373-ES3 1999- 
© Springer- Verlag Berlin Heidelberg 1999 
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Such networks are also interesting from the point of view of connecting quantum 
computers and implementing a model of distributed quantum computing. For 
storing and manipulating quantum information atoms are perfect candidates. 
The qubit is stored in two ground states (one or both states could be metastable 
states as well) |0) and |1), so that the quantum memory does not suffer from 
spontaneous emissions. In order to place the atom in a well-defined environment, 
a cavity is used and the atom is strongly coupled to one cavity mode. 

The basic idea for transmitting a qubit is now to manipulate the atom by 
appropriately designed laser fields such that the qubit stored in the atom is 
coherently transferred to the cavity mode. Then the cavity photon will leak out 
of the cavity, travel through vacuum or a fiber to another cavity, enter that 
cavity, and by laser manipulation the information can finally be transferred to 
another atom. In |S| the problem of constructing laser pulses that fulfill this 
goal has been solved. In uni a different idea was proposed, based on adiabatic 
passage, so that the requirement on pulse shapes is relaxed. Crucial here is also 
the possibility of implementing error correction schemes with finite resources. 
Standard error correction schemes and purification protocols work only if the 
number of qubits is large, in principle even infinite. Here we will show it is 
possible to construct different types of error correction protocols, which exploit 
the physical properties of the systems involved. 



2 Photonic Channel 

Specific descriptions of both atoms and cavities have been given in jSj and ini 
Here we give a description of transmission of qubits using photons from a more 
general point of view, which will allow us to optimize encoding qubits in photons. 
We first will define the “photonic” channel and subsequently show how to 
establish a distant EPR pair over this channel. Once one has produced such a 
pair, one can transmit any qubit of information by using teleportation . 

The standard way of encoding qubits in photons is to use two orthogonal 
polarization states, denoted by -I- and — . Since channels are not noiseless, a 
polarized photon may arrive with that polarization, but also may change polar- 
ization, and/or the state may undergo phase shifts. For example, the depolarizing 
channel [H] assumes that with probability F the qubit is left intact and with 
probabilities (1 — F')/3 it undergoes a sign flip, a spin flip, or both. These errors 
are represented by the Pauli spin operators <Jz,x,y acting on the qubit and are 
usually considered the most general type of error occurring. However, photons 
also may be absorbed, and new photons may be created. In order to include the 
latter errors we adopt a general formalism to describe transmitting photon states 
over an arbitrary channel, whose noise can be fully described by its interaction 
with the environment \E). Thus, the most general transformation of input states 
sent over a noisy channel is 

|1+)|0_)|I5) ^ (|l+)|0_)Ti + |0+)|0_)T2 + |1+)|1-)T3 + |2+)|0_)T4 + . . .)!£;), 

|0+)|1_)|I5) ^ (|0+)|1_)T/ + |0+)|0_)T/ + |1+)|1-)T3' + |0+)|2-)T4' + . . .)|i?), 



( 1 ) 
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with \n±) denoting a state with n photons of polarization ±, while all other pho- 
tonic modes are included in|_E). Here the errors are written in terms of operators 
T or T' acting on the environment \E), where all operators are in general differ- 
ent, only restricted by the fact that the total transformation should be unitary. 
In practice, photon absorption is in fact the dominant error over longer distances. 
On the other hand, creation of photons in the optical regime is negligible even 
at room temperature. This indicates that the best way of encoding qubits into 
photons, is to let |0) correspond to sending no photons, with the simple idea 
that if one sends no photons, they cannot be absorbed. On the other hand, |1) 
would then correspond to one photon with particular polarization and frequency 
content. Thus, neglecting the creation of photons and using this encoding, we 
can then write for the general action of a noisy channel 

|0) ^ |0)To, 

|1) ^ |l)Ti + |0)T, (2) 

where we have omitted the initial state of the environment. The operator 7^ 
describes the disappearance of a photon of the chosen polarization, either due 
to photon absorption or due to a polarization change. We emphasize that this 
formulation of encoding and transmission 0 incorporates more physical pro- 
cesses (i.e. is more general) but still is simpler than the usual one using two 
polarizations. 

Having determined from simple physical arguments the best way of encoding 
photons, we now need to include in this description how (optical) photons are 
produced. In general, we can assume that the photon is produced and detected 
by making an atom change its internal state. Again, we wish to describe this 
process in the most general fashion. We consider two atoms A and B belonging 
to Alice and Bob, who have in their labs nodes 1 and 2, respectively. We denote 
by |0) and |1) two internal (ground-state) levels of the atoms, and by \x) any 
other level that may be involved in the process. We assume that the transmission 
process is such that the sending atom will produce a photon only if it started in 
the state |1), whereas no photon is produced if it is in |0). The receiving atom is 
prepared in state |0), and, ideally, is transferred to state |1) upon absorbing this 
photon in the correct mode, whereas if no photon arrives, the atom will remain 
in |0). Of course, errors may occur in this process: when producing a photon 
the first atom may undergo a transition to any other state |a:), and similarly, 
the photon arriving at the second atom may induce a transition to any other 
state. In order to remain in the 2-dimensional Hilbert space of the atoms after 
the transmission we optically pump the sending atom to the state |0) and in the 
receiving atom we pump any state \x yf 0, 1) to the state |0). Thus, the atoms 
undergo the following general process: 

Ixo)|0)a|0)b^ Ixo)|0)a|0)bTo, 

Ixi)|1)^|0)b ^ |xi)|0)a(|1)bTi + |0)bT,). (3) 

Here, the operators To i ^ contain spontaneous emission, photon absorption and 
transitions to other states, followed by repumping to |0). Thus, all complicated 
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physics is hidden in the three operators. We have explicitly excluded the states 
of all the other atoms in the network from the state \E) of the environment, and 
have included them in |xo,i)- In the following we will call the channel defined 
by 0 the “photonic channel” m- It describes in a general fashion the effect on 
atoms that are used to produce and detect photons, which in turn are transmitted 
over a noisy and lossy channel. 

The goal is, using the photonic channel, to establish a perfect EPR pair. Since, 
in classical language, there are nonzero probabilities of errors described by the 
operators az and cr_ = {ax — iay)j2, straightforward application of the standard 
purification schemes would still require in principle infinitely many atoms to 
purify to a perfect EPR pair. Since we are interested in purification using only 
a few atoms, we need a different idea. The protocol designed in HH indicated 
already that the restriction of using few qubits does not in principle prevent 
error correction schemes to be developed. There we exploited a certain property 
of the interaction with the environment (namely, that it is Markovian), but here 
we do not impose such conditions on the errors. The protocol is consequently 
more complicated, and we split the description in two parts. First we show how 
to reduce the action of the channel to that of a simpler one. Then using this 
simpler effective channel, we discuss the actual purification scheme. 

2.1 Channel Rednction 

Here we show how one can effectively eliminate the absorption term with Ta 
from the channel Q)’ by using a few auxiliary atoms in each node, and by 
performing local operations. It is important to note here that we do not require 
any assumption on the operator Ta- 

The goal is first to reduce the channel description to 

Ixo)|0)a|0)b I — > Ixo)|0)a|0)b5o, 

Ixi)|1)a|0)b I — *■ Ixi)|1)a|1)b5i, (4) 

with modified operators S, to be determined below. In fact, we will show that, 
starting from a state (|xo)|0)a + Ixi)|1)a) |0)b we obtain either the state 

Ixo)|0)a|0)b5o + |xi)|1)a|1)b5i, (5) 

or 

(Ixo)|0)a + |xi)|1)a)|0)b5„ (6) 

depending on the result of a set of measurements done on auxiliary atoms. In 
the latter case, we still have the same initial state as the environment factors 
out, so that we may start the procedure again, until the measurements indicate 
we have succeeded in obtaining the state ((HJ. In that way, we effectively have 
reduced the channel to 0). 

The procedure is as follows: we need 2 auxiliary atoms in node 1 , denoted 
by A2 and H3, and one in node 2 , atom i?2- First we entangle atoms A and A2 
(which starts in |0)) according to 
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|0 )a| 0 )a, ^ | 0 )a| 0 )a, 

|1)^|0)^, ^ |1)a|1)a,. (7) 

Then we transmit the qubit A 2 to atom B according to 0. Clearly, atom A 2 
ends up in |0) and plays in fact the role of a dummy qubit. Thus, leaving out 
the state of A 2 , we obtain with this procedure the mapping 

(Ixo)|0)a + |xi)|1)a)|0)b ^ [|xo)| 0)^|0 )bTo + |xi)|l)^((|l)sTi + |0 )bT„)] , 

( 8 ) 

Next we apply a local transformation between A and A^: 

|0)a|0)a3 |0)A|0)yi3 + |1)a|1)^3, 

|1)a|0)a3-|0)a|1)^3 + |1)a|0)a3 (9) 

and proceed as follows: (i): At time t we transmit the qubit A to B, according 
to (jSJ, i.e. by using the dummy atom A 2 . (ii): We interchange |0)a ^ |1)a (Hi)- 
At time t' > t we transmit the qubit A to B 2 according to (01; (iv): We undo 
step (ii), i.e. interchange |0)yi ^ |1)^. After these 4 steps, we obtain the state 

|o)a3Ixo)|o)a ® [\o)B\i)B,m')Toit) + |0)B|0)s,r„(T)ro(t)] 

+ | 0 )^ 3 | xi )| 1 )a 0 [\l)B\0)B,Mt')Ti{t) + \0)B\0)B,Toit')%{t)] 

+ | 1 )^ 3 | xi )| 0 )a 0 [\0)B\l)B,m')To{t) + \0)B\0)B,%{t')Toit)] 

+ |1)^3|Xo)|1)a 0 [\l)B\0)B,To{t')T,{t) + \0)B\0)B,Toit')Ta{t)] (10) 

All the terms containing the photon absorption error operator Ta pertain to the 
combination |0 )b|0)b 2. Hence, a measurement is performed on atoms B and B 2 
to check whether they are in that joint state; (a) If the outcome is negative, the 
parts containing Ta are projected out. We continue then by basically eliminating 
atom B 2 by performing the unitary transformation 

|0)b|1)b2 |0)b|0)b2 |1)b|0)b2 |1)b|0)b2- 

Subsequently we measure the state of A3. If we find |0 )a 3 then we have (0 with 
5o = Ti{t')To{t) and 5i = To{t')T{t), and if we find 11)213, have to interchange 
|0 )a ^ |1)a, and thus obtain 0 with the definitions of 5 q,i interchanged, (b) 
If the outcome is positive, an error has occurred and we have projected onto 
the terms containing Ta- We measure the state of A in the 0, 1 basis, and then 
swap the state of A3 into A. If the outcome was lO)^, then one has 0 with 
Sa = Ta{t')To{t), and if we found ll)^ we would instead have Sa = To{t')Ta{t), 
after interchanging lO)^ <-> |l). This completes the proof: either we have case (a) 
corresponding to 0, or case (b) and ©. 

2.2 Purification Protocol 

If photon absorption can be modeled by a Markovian master equation and if 
other (design) errors are assumed to be systematic (i.e. the same in two sub- 
sequent transmissions) we in fact have a “stationary channel!' which is defined 
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by the condition 7i(t')7o(t)|i?) = TQ{t')Ti{t)\E) . In this case the channel ® 
would allow for ideal transmission without further actions, as Sq = S\. This is 
the situation discussed in Q. However, now we are interested in the general 
case where the stationarity property does not apply. In particular, this will be 
the case where there are additional random errors and when a Markovian de- 
scription of decoherence is questionable. In the following we will show how to 
establish distant EPR pairs using the channel i). The only assumption we will 
use, is that Sq is in some sense ’’close” to 5i, rather than equal. In words, we 
only assume that the “random” part of the errors is smaller than the systematic 
part (the latter including the no-error part). 

Alice and Bob wish to obtain a perfect EPR pair of the form 

= |0).i|0)b+|1)a|1)b (11) 

which we denoted by the “right” state \R)ab, and where we omit trivial normal- 
ization factors. In the following Alice and Bob will perform a specific protocol 
several times, which is such that the state of particles A and B is actually a 
superposition of two Bell states, the second one denoted by “wrong” 

\W)ab = \0)a\1)b + \1)a\0)b ( 12 ) 

For instance, a single use of the channel 0) will produce such a state: For 
convenience, we define the dual basis states |±) = |0) ± |1). Alice prepares her 
qubit A in the state |-|-)^ and Bob prepares his qubit B in |0)s. They use the 
channel and subsequently both of them apply the local operation |0) i-^- |-|-), 
and |1) 1 -^- |— ). This way they obtain the state 

m = \R)abI(So+Si)\E) + \W)abI(So - Si)\E). (13) 

After this, Alice and Bob will repeat the protocol described below, and in terms 
of the state of the environment, at the iVth step they will have a state of the 
form 

= \R)AB\E^r) + \W)ab\e[^^), ( 14 ) 

where are unnormalized states of the environment. 

In order to characterize the quality of the state we define its fidelity 
as Ftv = II The goal is to increase the fidelity to unity, so that for 

large N the state of the system will tend to |i?). For the following, two auxiliary 
atoms Aland Bi are needed, in the first and second cavity, respectively, (i) 
The auxiliary qubit Ai is locally entangled with the qubit A according to the 
transformation 



|0).4|0)ai |0)a|+)ai; |i)a|0)ai |i)a|— )ai- (15) 

(a) The qubit Ai is transmitted to the auxiliary qubit Bi according to the 
effective channel ®. Then, the qubit Ai is measured in the |±)ai basis. If the 
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result is |— )ad one applies the unitary operation |l)si — |1 )bj. After this, 
one applies a controlled rotation 

\x)b\v)b^^ {-^T^\x)B\y)B^ x,y = 0,l. (16) 

The state after these operations is 

= |i?)As(|0)B,5o + \1)b,Si)\E^j^^) + \W)ABmB^So - 

( 17 ) 

(in) The auxiliary qubit B\ is measured. If one finds B\ in the |+) state, one 
gets 

|7^1f^'^) = ^(‘5o + 5i)|ifW), (18) 

|7^!^+'^) = ^(5o-5i)|aW), (19) 

In this case, the “right” state is multiplied by the operator 5 q + 5i, which is a 
“larger” operator than the Sq — S± combination, which multiplies the “wrong” 
state. Thus, in this case the fidelity increases, in a manner specified below. If, 
on the other hand, the result of the measurement on Bi is |— ), one has 

= (20) 

|7^!^^'^) = ^(5o + 5i)|£;W), (21) 

which implies that the fidelity has decreased. We denote by P± the probability 
of these two outcomes. 



2.3 Analysis of Protocol 



We analyze how the fidelity changes after each step, for which we need to evaluate 
P± . To this end we define for arbitrary states of the environment the quantities 



7 T± = 



||i(5o±5i)|A)|p 

ll|7?)IP 



( 22 ) 



To calculate tt± one in principle would need to know the specific form of the 
operators and states at all times. We assume for simplicity, however, that the 
TT± do not depend on \E). Moreover, the only assumption on the operators 5 q±5i 
is that 7T+ > 7T_. Using the definition ()22l we have then 



P± = T^±En + 7 Tzp(l — En), 



(23) 



which implies that P+ > P_. Hence, the probability that the fidelity increases 
is larger than the probability that E decreases. In fact, corresponding to the 
measurement |±) the new fidelity is 



Fn+1 



t^±En 

t^±Fn + 7 T=p(l - Em) ’ 



(24) 
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respectively. For 7r+ > 7 t_, the outcome \+)bi increases the fidelity and occurs 
with a higher probability. Since the decrease in fidelity after a |— )bj^ measurement 
is compensated for by a subsequent \+)bi measurement, the protocol consists of 
a random walk along a set of particular values of F, where it is more likely to 
go up than to go down, thus achieving f | 1 asymptotically. The effectiveness 
of the process depends only on the value of 7r+ which characterizes the effective 
channel. We have simulated the improvement of the fidelity for several values of 
the probability 7 t+. For example, we obtain F = 1 — 10“^ after = 12 steps for 
7T+ = 0.9, which indicates that the scheme is quite effective. In fact, unit fidelity 
is approached exponentially fast with the number of steps. This can be shown 
analytically as follows: First, we note that, if the number of upward steps in the 
random walk in excess of downward steps equals N\, then the fidelity is 



F(iVi) 



iVi 




(25) 



so that, conversely, in order to reach a final fidelity Foo 
approximately 



Ni{S) 



log (w^) 

log(7r_/7r+) 



1 — (5 one needs 



(26) 



steps. Second, since not every measurement will have the outcome +,we also 
need the number of steps N 2 needed to reach Ni steps up in excess of steps 
down. This is approximated by the average of Ni/{P+ — P-), 



N2 




dF 



Ni 

S{2F-1) 




1-25 

S 



(27) 



where S = 7r+ — 7r_ , and where we used that the initial value of F equals 7r+ . 
The total number of steps required to reach a fidelity 1 — 5 is, therefore. 



N 2 



1 



log 



1-25 



log(l^) 



_ log(l/5)log(l/J) 
S log(7r_/7r+) 25'log(7T+/7r_) 



(<5 ^ 0) (28) 



which for small 5 behaves as log(l/5), so that conversely the fidelity grows ex- 
ponentially fast to 1 with the number of steps taken. 

Finally, let us note that for a “good” standard channel [see Eq. ®], To is close 
to T). As a consequence of our reduction procedure this implies that Sq ~ 
and therefore 7r+ ~ 1, and 7 t_ 1. We emphasize that the reverse statement 

is not true, namely one can have Sq ~ 5i but a “bad” standard channel. This 
shows our procedure introduces a new element, allowing one in certain cases to 
develop improved error correction schemes. 



3 Further Developments 

So far we tacitly assumed all local operations to be perfect, which is the stan- 
dard assumption for designing purification protocols. The reason is that nonlocal 
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operations are, of course, much more difficult to control than local operations. 
However, in practice also 2-bit local operations are not at all easy to perform. 
For instance, only very recently has one succeeded in entangling two atoms 0. 
Since on the other hand, 1-bit operations can be considered routine, an im- 
portant question to answer is, whether one can ’’purify” 2-bit local operations 
provided 1-bit operations are perfect. This question we have answered recently 
in where it was shown that, e.g., the joint measurement operation needed 
for the purification protocol discussed above can actually be purified. A crucial 
property is that this measurement can be performed ’’fault-tolerantly”, in the 
sense that an error occurring during the measurement is always detected by the 
measurement itself. Moreover, with a more complicated procedure, also a uni- 
versal 2-bit gated can be performed perfectly in the same sense that errors may 
occur, but will always be detected. 

A second avenue of further research is the following: the error correction 
protocol presented above works to all orders in the photon absorption error. 
The larger the probability for this error to occur is, the more often one has 
to go through all the steps of the protocol. Since the error probability scales 
exponentially with the fiber length L, the number of tries in the protocol, and 
hence the time required for sending a qubit, increases exponentially with L as 
well. In order to avoid this, we can divide the fiber into N segments and place 
’’quantum repeaters” at each segment, just as in classical communication. An 
analysis of this problem P) shows that the fidelity of EPR pairs thus obtained 
decreases exponentially with the number of segments. This drawback, however, 
is not very serious, as above-described error correction scheme can increase the 
fidelity exponentially fast with the number of steps, as just shown. On the other 
hand, including now also local errors, this is no longer true, and another higher- 
level purification scheme is needed. This is reminiscent of concatenated quantum 
codes for fault-tolerant quantum computation H3|. A main difference is that in 
the present case it is not required to be able to protect arbitrary states sent 
over a channel, as we only attempt to establish an EPR pair. As a result, the 
threshold for tolerable errors in local operations is a few percent, rather than 

10 - 5 . 
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1 Introduction 

Any realistic approach to quantum computing must be capable of implement- 
ing a sufficiently large number of qubits to perform useful computations. It has 
been estimated that 10^ qubits may be required to implement Shor’s factor- 
ing algorithm for integers of useful size, and that this number may grow to 
10® qubits when the redundant bits required for quantum error correction are 
included. Many of the quantum computer implementations currently being in- 
vestigated, such as ion traps or NMR techniques, provide a reasonably straight- 
forward method for implementing a few qubits but are subject to fundamental 
limitations on the number of qubits that can be implemented P. P 

The long-term goal of our work is to develop an optical approach to quan- 
tum computing in which the logic gates and memory devices have a sufficiently 
simple structure that large numbers of devices could be mass-produced at a 
reasonable cost. This is a modular approach in which independent logic gates 
could be connected using optical fibers or waveguides. The intrinsic decoherence 
of the devices is expected to be sufficiently small that quantum error correction 
techniques may eventually be able to compensate for them. In that case, the 
modular nature of the approach and the simple structure of the devices may 
allow the implementation of a full-scale computer. 

One of the main difficulties in any optical approach to quantum comput- 
ing is the small magnitude of the nonlinear interaction between two photons, 
which is required for the implementation of quantum logic gates. Other research 
groups |3j have obtained nonlinear phase shifts at the two-photon level by con- 
fining the photons to high-Q cavities and using atomic beams or traps, but that 
approach may not be feasible for the construction of full-scale computers due 
to the size and complexity of the cavities and traps. Our approach relies on 
a newly-predicted mechanism ^ for the production of nonlinear phase shifts 
that is non-classical and exists only at the quantum level. This mechanism is 
expected to give nonlinear phase shifts that are orders of magnitude larger than 
those obtained previously, which should eliminate the need for resonant cavities 
and atomic beams or traps. 

2 Nonlocal Enhancement of Nonlinear Phase Shifts 

Nonlinear phase shifts at the two-photon level can be used to implement quantum 
logic gates, as will be discussed in more detail below. Previous mechanisms for the 
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production of nonlinear phase shifts give a phase shift that is proportional to the 
number Na of atoms in the medium, since both photons must interact with the 
same atom. Our new mechanism involves the interaction of photons with pairs of 
atoms in the medium. Since the number of pairs of atoms is proportional to N\, 
this gives a nonlinear phase shift proportional to iV^ and a large enhancement 
of the phase shift for large values of Na- 

The new mechanism for the production of nonlinear phase shifts is illus- 
trated in Figure 1. Two photons with frequencies u>i and u >2 propagate through 
a medium and interact with two atoms, such as those labeled A and B in the 
figure. As illustrated in the Feynman-like diagram of Figure 2, atom A absorbs 
photon 1 and re-emits photon 2, while atom B absorbs photon 2 and re-emits 
photon 1 . This exchange of the two photons has no net effect other than to pro- 
duce an energy shift AE and a corresponding phase shift that can be calculated 
using fourth-order perturbation theory. Each pair of atoms corresponds to a dis- 
tinct Feynman diagram, so that the sum over intermediate states gives a factor 
of Since Na can be on the order of 10^^, this interaction is expected to give 
phase shifts that are many orders of magnitude larger than those obtained using 
earlier methods. 
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Fig. 1. Nonlinear phase shift that occurs when two photons pass through a 
medium containing Na atoms, two of which are labeled A and B. 



The total contribution from all Feynman diagrams of this kind gives a net 
phase shift proportional to 



^M‘^N\nin2fR 

55 JsTW- 



( 1 ) 



Here AE is the energy shift calculated using perturbation theory, (5i and S 2 
are the photon detunings (difference in photon energy from the atomic excitation 
energy), M is the atomic matrix element, rii and U 2 are the number of photons 
in each beam, w is the line width due to collisions, and fa is a dimensionless 
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factor on the order of unity that is associated with atomic recoil . Under the 
appropriate experimental conditions, it can be shown that the nonlinear phase 
shift is approximately related to the usual, linear phase shift (index of refraction) 

by 



^<Pnon ~ ( 2 ) 

It is relatively easy to obtain linear phase shifts of tt radians with negligible 
decoherence due to absorption and scattering, in which case the nonlinear phase 
shift should also be on the order of unity with very small decoherence. 




Time 



Fig. 2. Typical Feynman- like diagram in which the exchange of two photons by 
a pair of atoms gives a nonlinear phase shift proportional to N\. 



Since the two photons have different energies, the process shown in Figure 2 
does not conserve energy at the location of atom A nor at the location of atom B, 
but it does conserve energy overall. Roughly speaking, each atom must “know” of 
the presence of the other atom in order to conserve the overall energy at the end 
of the process, which suggests that this is a nonlocal process somewhat similar 
to the EPR paradox. It can be shown that this effect cannot be derived using 
the nonlinear susceptibility coefficients commonly employed in nonlinear optics. 
In addition, phase shifts of this kind only occur for states with a well-defined 
number of photons and no such effect is expected to occur for classical beams of 
light. 

3 Preliminary Experimental Results 

A preliminary series of experiments have been performed in a sodium vapor cell 
using the apparatus outlined in Figure 3. Without describing any of the details, 
the experiment was configured in such a way that the nonlinear phase shifts 
described in section 2 would cause the presence of photon 1 to rotate the linear 
polarization of photon 2 if both photons passed through the sodium vapor cell at 
the same time. A polarization analyzer oriented at 90° to the original polarization 
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of photon 2 was placed in its path after it had passed through the sodium vapor 
cell. If photon 1 passed through the cell at the same time, the rotation of the 
polarization of photon 2 would allow it to pass through the analyzer and be 
detected. If the two photons passed through the cell at different times, no such 
polarization rotation would occur and photon 2 would not be detected (for a 
perfect analyzer). Thus the presence or absence of photon 1 could control the 
path taken by photon 2, which corresponds to an elementary logic switch. 




Fig. 3. Simplified design of the experimental apparatus. 



An effect of this kind can be seen in the preliminary experimental results of 
Figure 4, which is a plot of the rate at which both photons were detected as 
a function of the difference in their arrival times. The peak at zero time delay 
suggests that photon 1 can control the polarization of photon 2, as expected. 
This peak is significant at the three-standard-deviation level, based on a careful 
Monte Carlo simulation of the data collection process and the observed data. 
The probability that such a peak is a statistical anomaly is approximately 10“^. 
Smaller peaks were observed at lower sodium densities as expected. 

Dispersion relations based on causality provide a connection between the real 
part of the index of refraction (phase shifts) and the imaginary part (absorption 
or scattering) . The imaginary part of the index of refraction was also measured 
by removing the polarization analyzer and focusing the photon 2 beam near the 
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edge of the corresponding detector, so that any scattering would be expected 
to produce a dip in the coincidence curve rather than a peak. The results of 
measurements of that kind show a dip in the coincidence curve at zero time 
delay, as expected. 
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Fig. 4. Preliminary results of nonlinear phase-shift measurements. 



At the time of this conference, similar effects have been seen in a number 
of sodium vapor cells. These effects vanish at low sodium vapor pressures and 
in the absence of any buffer gas, as would be expected from the theory. We are 
now preparing to make a series of systematic measurements in order to verify 
the dependence of the phase shift on This will require a new design for the 
sodium vapor cell and the use of higher speed electronics. Although systematic 
measurements of this kind are required in order to conclusively demonstrate the 
existence of the predicted effect, the preliminary results do suggest that nonlinear 
phase shifts can be obtained at the two-photon level without the use of resonant 
cavities. 

Similar effects are expected in crystalline materials, which provide higher 
atomic densities and long-term stability. The use of solid-state materials will 
probably be necessary for the construction of useful quantum logic gates. An 
investigation of these nonlinear phase shifts in various crystalline materials is 
planned. 
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4 Quantum Computer Implementation 

We have also investigated the implementation of a quantum computer using 
these nonlinear effects in some detail. The implementation of a Controlled-NOT 
gate is shown in Figure 5. Here the nonlinear phase shift is employed in an 
interferometer arrangement to allow photon 1 to control the output path taken 
by photon 2 |5|. Additional nonlinear phase shifts have been included to provide 
the conventional overall phase shifts for each of the possible input states. 
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Fig. 5. Implementation of a Controlled-NOT gate giving the conventional phase 
shifts. 



The new mechanism for nonlinear phase shifts requires that the two photons 
have different frequencies u>\ and 0 J 2 - The implementation of a practical quantum 
computer would, instead, require logic gates acting on two photons of the same 
frequency. This can be accomplished by introducing a scratch bit corresponding 
to a photon at a different frequency, as illustrated in Figure 6. The scratch bit 
is returned to its original state at the end of the operation, so that “garbage” 
bits do not accumulate. 
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Fig. 6. A Controlled-NOT gate acting on two photons A and B with the same 
frequency. A scratch qubit at a different frequency is used but returned to its 
initial state of 0. 
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In this approach, the memory registers would be implemented by switching 
a single photon into one of two optical-fiber loops, as illustrated in Figure 7. We 
have estimated that several million logic operations could be performed during 
the time that a photon could be stored in this way. Quantum error correction 
techniques could then be used to extend the effective storage time to much 
larger values. Several techniques for compensating for the effects of dispersion 
are currently under consideration. 
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Fig. 7. Proposed memory storage device consisting of two loops of optical fiber 
with electro-optical switches. 



5 Summary 

An optical approach to quantum computing is expected to have many potential 
advantages, including the ability to connect independent logic gates with optical 
fibers or waveguides. We are investigating a new mechanism for the production 
of the required nonlinear phase shifts at the two-photons level, which is based 
on nonlocal interactions between pairs of atoms in a medium. A preliminary 
series of experiments provides some evidence for the existence of the predicted 
nonlinear phase shifts. The implementation of a quantum computer using this 
mechanism has been considered in some detail. 

This approach relies on new physics that has not been thoroughly investi- 
gated, and unexpected difficulties may be encountered. Nevertheless, the modu- 
lar nature of the approach, the simplicity of the logic devices, and its relatively 
low rate of decoherence may eventually allow the construction of full-scale quan- 
tum computers. 
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Abstract. We present a constructive method to translate small quan- 
tum circuits into their optical analogues, using linear components of 
present-day quantum optics technology only. These optical circuits per- 
form precisely the computation that the quantum circuits are designed 
for, and can thus be used to test the performance of quantum algorithms. 
The method relies on the representation of several quantum bits by a 
single photon, and on the implementation of universal quantum gates 
using simple optical components (beam splitters, phase shifters, etc.). 
The optical implementation of Brassard et al.’s teleportation circuit, a 
non-trivial 3-bit quantum computation, is presented as an illustration. 



1 Introduction 

The promise of ultrafast computation using quantum mechanical logic raised by 
Shor’s discovery of a polynomial algorithm for factoring PJ has yet to materialize 
in physical implementations. While quantum logic has been implemented in a 
number of different guises m, the dynamics and behavior of a quantum circuit 
subject to noise and quantum decoherence has only been tested in simulations 
on a classical computer |4lb) (but see jOj). There is little controversy about the 
realization that it is the quantum mechanical superposition principle, and the 
entangled, nonlocal, states it engenders, that are at the origin of the speed-up of 
quantum algorithms with respect to their classical counterparts. Still, effective 
quantum algorithms are few and far between, and even those that are known 
today have yet to be tested in a physical realization (but see 0-) 

In anticipation of physical realizations that implement quantum superposi- 
tions between physical states, we present here a method of constructing circuits 
based on non-local superpositions of “eventualities”, rather than physical ob- 
jects. More precisely, we simulate quantum superpositions, “qubits”, as “which- 
path” eventualities in linear optics, implementable on standard optical benches. 
While the “support” of these qubits is decidedly classical (the optical devices 
such as beam splitters, polarizers, etc.) the wave function at the exit of the 
optical circuit can be made to coincide arbitrarily well with the outcome of the 
anticipated computation, thus implementing the quantum circuit. Naturally, this 
“classical” implementation of quantum logic has its drawbacks, as we comment 
on further below. Still, it should provide an excellent (and cost effective) means 
for testing small circuits for quantum error correction or quantum algorithms. 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. .891 - BTm 1999. 

© Springer- Verlag Berlin Heidelberg 1999 



392 



C. Adami and N.J. Cerf 



As we point out below, the realization that optical which-path eventualities 
of single photons can simulate qubits is not new in itself. Here, we focus on 
protocols to translate any quantum circuit diagram into linear optics networks, 
which puts the realization of simple circuits decidedly within reach. Quantum 
computation can be described as the task of performing a specific unitary trans- 
formation on a set of quantum bits (qubits) followed by measurement, so that 
the outcome of the measurement provides the result of the computation. This 
unitary transformation can be constructed with a finite number of 4 x 4 unitary 
matrices, that is, using a quantum circuit utilizing only 1-bit and 2-bit quantum 
gates (see, e.^., I8I91 1 . The universality of 1- and 2-bit gates in the realization of 
an arbitrary quantum computation was shown in m- Furthermore, it was real- 
ized recently that an optical realization exists for any N x N unitary matrix m, 
a result which generalizes the well-known implementation of U (2) matrices us- 
ing a lossless beam splitter and a phase shifter (see, e.g., ^2]). Accordingly, 
each element of U{N) can be constructed using an array of 0{N^) beam split- 
ters that form an optical multiport with N input and N output beams. As we 
shall see below, this result together with the universality of (1- and 2-bit) gates, 
can be exploited constructively, providing a systematic method for assembling 
optically-simulated gates to build simple quantum circuits. 

2 Logical Qubits in Optics 

Let us start by considering the equivalence between traditional linear optics 
elements (such as beam splitters or phase shifters) and 1-bit quantum gates 
(see, e.g., [1 ,3j 1 . This equivalence is inspired by the standard two-slit experiment 
of quantum mechanics, in which a single quantum can interfere with itself to 
produce fringes on a screen. Accordingly, a quantum on the other side of the slit 
is in a superposition of paths, and the quantum mechanical uncertainty principle 
is in full effect with respect to location and phase H3I. 

For example, in quantum circuit terminology, an optical symmetric beam 
splitter is known to act as a quantum vNOT gate (up to a phase of tt/ 4) if we 
use the pair of input modes |01) (or |10)) to represent the logical 0 (or 1) state 
of the qubit. If one input port is in the vacuum state |0) (absence of a photon) 
and the second one is in a single-photon state |1), the output ports will then be 
in a superposition state |01) -I- i|10). Thus, with the identification of the logieal 
qubits |0l) = |01) and |1l) = |10), we produce the wavefunction 

|^) = ^(|0i) + *|U)) , (1) 

from the initial state |0l) just by running a photon through a beam splitter. 
(The factor i arises from the tt/ 2 phase shift between the transmitted and the 
reflected wave in a lossless symmetric beam splitter [?].) 

Similarly, a quantum phase gate can be obtained by use of a phase shifter 
acting on one mode of the photon. In other words, single-photon interferometry 
experiments can be interpreted in quantum circuit language, the “which-path” 
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Fig. 1. Example of optical simulation of basic quantum logic gates, (a) 
Hadamard gate on a “location” qubit, using a lossless symmetric beam split- 
ter. (b) Controlled-NOT gate using a polarization rotator. The location and 
polarization are the control and target qubit, respectively, (c) Same as (b) but 
the control and target qubits are interchanged by the use of a polarizing beam 
splitter. 
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variable being substituted with a quantum bit. Although a general proof for the 
existence of an optical realization of an arbitrary quantum circuit is implicitly 
given in Ref. CH, the simple duality between quantum logic and single-photon 
optical experiments has not been exploited. Here (and in m) it is shown that a 
single-photon representation of several qubits can be used to exploit this duality: 
as several (say n) quantum bits can be represented by a single photon in an inter- 
ferometric setup involving essentially 2" paths, quantum conditional dynamics 
can easily be implemented by using different optical elements in distinct paths. 
The appropriate cascading of beam splitters and other linear optical devices en- 
tails the possibility of simulating networks of 1- and 2-bit quantum gates (such 
as the Hadamard or the controlled-NOT gate, see Fig. CJ, and thereby in prin- 
ciple achieving universal n-bit quantum computations P). This is in contrast 
with traditional optical models of quantum logic, where in general n photons 
interacting through nonlinear devices (acting as 2-bit quantum gates) are re- 
quired to represent n qubits (see, e.g., Pj). Such models typically make use of 
the Kerr nonlinearity to produce intensity-dependent phase shifts, so that the 
presence of a photon in one path induces a phase shift to a second photon (see, 
e.g., the optical realization of a Fredkin gate HZI)- Instead, the method proposed 
here yields a straightforward method for “translating” any n-bit quantum circuit 
into a single-photon optical setup, whenever n is not too large. The price to pay 
is the exponential growth of the number of optical paths, and, consequently, of 
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Fig. 2. Implementation of two sequential Hadamard transformations as a bal- 
anced Mach-Zehnder interferometer using lossless symmetric beam splitters only. 





optical devices that are required. This will most likely limit the applicability of 
the proposed technique to the implementation of relatively simple circuits. 

First, let us consider a single-photon experiment with a Mach-Zehnder in- 
terferometer in order to illustrate the optical simulation of elementary quantum 
gates (see Fig. . One qubit is involved in the description of the interferometer in 
terms of a quantum circuit: the “location” qubit, characterizing the information 
about “which path” is taken by the photon. Rather than using the occupation 
number representation for the photon, here we label the two input modes enter- 
ing the beam splitter by |0) and |1) (“mode description” representation). The 
quantum state of the photon exiting the beam splitter then is |0') -I- i|l') or 
|1') -I- j|0') depending on the input mode of the photon. This is the -\/NOT gate 
discussed earlier. Placing phase shifters at the input and output ports as shown 
in Fig. the beam splitter can be shown to perform a Hadamard transforma- 
tion between input and output modes, i. e., 




1 

71 





( 2 ) 



In this sense, a lossless symmetric beam splitter (supplemented with two —tt/2 
phase shifters) can be viewed as a Hadamard gate acting on a location qubit. 
Recombining the two beams using a second beam splitter (see Fig. n in or- 
der to form a balanced Mach-Zehnder interferometer corresponds therefore, in 
this quantum circuit language, to having a second Hadamard gate acting subse- 
quently on the qubilQ Since = 1, it is not a surprise that the location qubit 
returns to the initial basis state (|0) or |1)) after two beam splitters (with the 
appropriate phase shifter). This sequence of two Hadamard gates simply con- 
veys the fact that the contributions of the two paths interfere destructively in 



^ Here and below, it is understood that the path lengths are adjusted so that the 
difference between dynamical phases vanishes. 
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Fig. 3. Implementation of two sequential Hadamard transformations with inter- 
mediate conditional operation on the polarization. This circuit produces an en- 
tangled state |0)poi|0)ioc + |l)poi|l)ioc between the polarization (which is initally 
in a product thate with the location qubit at the input port of the interferom- 
eter) and the location qubit (denoted by |0) and |1) in the figure) just before 
the final beam splitter, preventing the observation of interference fringes at the 
output of this interferometer. 





one of the output ports, so that the photon always leaves the interferometer in 
the other. 

More interestingly, consider now the same interferometer using polarized pho- 
tons (the photon is horizontally polarized at the input). Assuming that none of 
the devices acts on polarization, the photon exits the interferometer with the 
same polarization. In a circuit terminology, this corresponds to introducing a 
“polarization” qubit (|0)poi stands for horizontal polarization) which remains in 
a product state with the location qubit throughout the circuit. If a polarization 
rotator is placed in one of the branches of the interferometer, dipping the polar- 
ization from horizontal |0)poi to vertical |l)poi, it is well known that interference 
disappears since both paths become distinguishable. This corresponds to placing 
a 2-bit controlled-NOT gate (represented in Fig.^) between the two Hadamard 
gates, where the location qubit is the control and polarization is the target bit 
(see Fig.|^. The circuit in Fig. El thus simply implements the dynamics 

|0)poi|0)ioc ^ (|0)poi|0)ioc + |l)poi|l)ioc) (3a) 

^ 2 (|0)pol|0)loc + |l)pol|0)loc + |0)pol|l)loc — |l)pol|l)loc) ■ (3b) 

which “tags” each path with a particular polarization just before the final beam 
splitter in the sense that the polarization of the photon is dipped conditionally 
on its location. The disappearance of interference fringes then simply reflects 
the entanglement between location and polarization qubits (the reduced density 
matrix obtained by tracing over polarization shows that the photon ends up in 
a mixed “location” state, i.e., it has a 50% chance of being detected in one or 
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the other exit port). This suggests that Feynman’s rule of thumb (namely that 
interference and which-path information are complementary) is a manifestation 
of the quantum no-cloning theorem: the location qubit cannot be “cloned” into 
a polarization qubit. However, the fringes can be resurrected via a quantum era- 
sure procedure m (which involves placing polarizing beam-splitters, introduced 
below, at the exit ports of the construction) . 

The optical analogue of other basic quantum gates can be devised following 
the same lines. For example, a polarizing beam splitter achieves a controlled- 
NOT gate where the location qubit is flipped or not (the photon is reflected or 
not) conditionally on its state of polarization, as shown in Fig. Fredkin, Tof- 
foli, as well as controlled-phase gates can easily be simulated in the same manner 
but will not be considered here. The central point is that, in principle, a universal 
quantum computation can be simulated using these optical substitutes for the 
universal quantum gates. The optical setup is constructed straightforwardly by 
inspection of the quantum circuit. A circuit involving n qubits requires in general 
n successive splitting stages of the incoming beam, that is, 2"’ optical paths are 
obtained via 2" — 1 beam splitters. (The use of polarization of the photon as a 
qubit allows using 2"“^ paths only.) This technique is thus limited to the sim- 
ulation of quantum networks involving a relatively small number of qubits (say 
less than 5-6 with present technology). The key idea of a quantum computer, 
however, is to avoid just such an exponential size of the apparatus by having n 
physical qubits performing unitary transformations in a 2"-dimensional space. 
In this respect, it can be argued that an optical setup requiring ^ 2" optical 
elements to perform an n-bit quantum computation represents a classical optical 
computer (see, e.g., 0). Accordingly, the issue of whether non-locality (which 
is at the heart of entanglement) is physically present in the optical realization is 
a matter of debate. 

3 Optical Quantum Teleportation 

As an illustration, we show that a quantum circuit involving 3 qubits and 8 
quantum gates (see Fig. B can be implemented optically using essentially 9 
beam splitters m- This circuil0 has the property that the arbitrary initial state 
1^) of qubit A is “teleported” to the state in which qubit A is left after the 
process. In the original teleportation scheme |^, two classical bits (resulting 
from a Bell measurement) are sent by the emitter, while the receiver performs 
a specific unitary operation on A depending on these two bits. However, it is 
shown in m that these unitary operations can be performed at the quantum 
level as well, by using quantum logic gates and postponing the measurement of 
the two bits to the end of the circuit. The resulting quantum circuit (Fig. E) is 
formally equivalent to the original teleportation scheme (although no classical 
bits are communicated) as exactly the same unitary transformations and quan- 
tum gates are involved. While we do not claim that an optical realization gives 
rise to “genuine” teleportation, this example circuit is instructive to demonstrate 



^ This teleportation circuit is equivalent to the one described in m- 



Quantum Computation with Linear Optics 397 



Fig. 4. Quantum circuit for teleportation (from The initial state of qubit 

A is teleported to the state of qubit A. Qubits a and A must be initially in state 
|0). Qubits A and cr, if measured at the end of the circuit, yield two classical 
(random) bits that are uniformly distributed. 



A 

a 

X 




0,1 

0,1 



the correspondence between quantum logic and optical devices as it is small (3 
qubits) but non-trivial. 

In the optical counterpart of this circuit (see Fig.^, qubits A and A corre- 
spond to the location of the photon at the first and second splitting level, while tr 
stands for the polarization qubit. Note that the photons are initially horizontally 
polarized, i.e., in polarization state |0). The first beam splitter A in Fig.|^acts 
as a Hadamard gate on A, as explained previously. For convenience, we depict 
the teleportation of state \il)) = |0), so that the incident photon enters this beam 
splitter in the input port labeled |0). However, as any operation in 17(2) can be 
realized optically, an arbitrary state of A can be prepared (and then teleported) 
by having an additional beam splitter (with tunable phase shifters) connected 
to both input ports of beam splitter A. The second level of beam splitters B 
(and B’jl corresponds to the Hadamard gate B on A in Fig. 0 The four paths 
at this point (AA = 00, 01, 10, and 11) label the four components of the state 
vector characterizing qubits A and A. The probability amplitude for observing 
the photon in each of these four paths, given the fact the photon enters the |0) 
port of beam splitters A and B, is then simply the corresponding component 
of the wave vector. The combined action of both controlled-NOT gates C in 
Fig. 0 is to flip the polarization state of the photon (qubit a) conditionally on 
the parity of A-|- A (mod 2), which is achieved by inserting polarization rotators 
C at the appropriate positions. In other words, the polarization is flipped on 
path 01 or 10, while it is unchanged on path 00 or 11. 

The Hadamard gate D in Fig. 0 acts on qubit A, independently of A. This is 
achieved in Fig. El by grouping the paths in pairs with the same value of A (i.e., 
crossing the paths) and using two beam splitters D in order to effect a Hadamard 
transformation on A (one for each value of A). Similarly, the controlled-NOT 
gate E acting on A (conditionally on the polarization) is implemented by the use 

^ For convenience, two realizations (B and B’) of the Hadamard gate are used in Fig.EI 
where B’ is obtained from B by interchanging the |0') and |1') output ports in Fig.^. 
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Fig. 5. Optical realization of the quantum circuit for teleportation using polar- 
ized photons. The location qubit A characterizes the “which-arm” information 
at the first beam splitter, while qubit A stands for the “which-path” informa- 
tion at the second level of splitting. The initial location qubit A is teleported 
to qubit A and probed via the interference pattern observed at the upper or 
lower {A = 0, 1) final beam splitter, for both polarization states (cr = 0, 1) of the 
detected photon. 




of two polarizing beam splitters E after crossing the paths again. A polarizing 
beam splitter leaves a horizontally polarized photon (state |0))) unchanged, while 
vertical polarization (state |1)) is reflected. The last Hadamard gate F in Fig. El 
corresponds to the last two beam splitters F, and the final controlled-NOT gate 
G is simply achieved by crossing the paths (A = 0,1) in the lower arm (T = 1) 
versus the upper arm (A = 0). In fact, the setup could be simplified by noting 
that the conditional crossing of paths achieved by G simply reduces to relabeling 
the output ports of beam splitter F in the A = 1 arm. In Fig. 0 only those phase 
shifters associated with the Hadamard gates (Fig. that are relevant in the 
final detection are indicated. 

The interpretation of this optical circuit in the language of teleportation is 
the following. After being “processed” in this quantum circuit, a photon which 
was initially horizontally polarized can reach one of the two “light” detectors 
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(solid line in Fig. 0 with horizontal or vertical polarization. This corresponds to 
the final measurement of qubits A and a in Fig. 01 yielding two classical (random) 
bits: upper or lower arm, horizontal or vertical polarization. The third qubit, A, 
contains the teleported quantum bit, that is, the initial arbitrary state of A. 
Since the location state of the photon is initially |0) in the setup represented in 
Fig.0 it always exits to the “light” detector and never reaches the “dark” one 
(dashed line). For any measured value of A (photon detected in the upper or 
lower arm) and a (horizontally or vertically polarized photon), the entire setup 
forms a simple balanced Mach-Zehnder interferometer. Indeed, there are exactly 
two indistinguishable paths leading to each of the eight possible outcomes (four 
detectors, two polarizations); these interfere pairwise, just as in a standard Mach- 
Zehnder interferometer, explaining the fact that the photon always reaches the 
“light” detector (in both A = 0 and yl = 1 arms and for both polarizations). 
In this sense, the initial “which-arm” qubit A has been teleported to the final 
“which path” qubit A. Note that, as no photodetection coincidence is required 
in this optical experiment, the setup is actually not limited to single-photon 
interferometry. This largely simplifies the realization of the optical source since 
classical light fields (such as those from a laser) can be used rather than number 
states. 

4 Conclusion 

We have proposed a general technique for simulating small-scale quantum net- 
works using optical setups composed of linear optical elements only. This avoids 
the recourse to non-linear Kerr media to effect quantum conditional dynamics, 
a severe constraint in the usual optical realization of quantum circuits. A draw- 
back of this technique is clearly the exponential increase of the resources (optical 
devices) with the size of the circuit. Nevertheless, as optical components that 
simulate 1- and 2-bit universal quantum gates can be cascaded straightforwardly, 
a non-trivial quantum computing optical device can easily be constructed if the 
number of component qubits is not too large. We believe this technique can 
be applied without fundamental difficulties to the encoding and decoding cir- 
cuits that are involved in the simplest quantum error-correcting schemes 1221 , 
opening up the possibility for an experimental simulation of them. Furthermore, 
this technique promises a technologically simple way to test quantum algorithms 
for performance and error stability. Last but not least, the correspondence be- 
tween quantum circuits and optical (interferometric) setups suggests that new 
and improved interferometers could be designed using the quantum circuit lan- 
guage | 23 ]- 
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Abstract. Photons in cavities have been already used for the realization 
of simple quantnm gates [Q.A. Turchette et al., Phys. Rev. Lett. 75 , 4710 
(1995)]. We present a method for combatting decoherence in this case. 



1 Introduction 

Quantum optics is usually concerned with the generation of nonclassical states 
of the electromagnetic field and their experimental detection. However with the 
recent rapid progress in the theory of quantum information processing the pro- 
tection of quantum states and their quantum dynamics also is becoming a very 
important issue. In fact what makes quantum information processing much more 
attractive than its classical counterpart is its capability of using entangled states 
and of processing generic linear superpositions of input states. The entanglement 
between a pair of systems is capable of connecting two observers separated by a 
space-like interval, it can neither be copied nor eavesdropped on without distur- 
bance, nor can it be used by itself to send a classical message Q. The possibility 
of using linear superposition states has given rise to quantum computation, which 
is essentially equivalent to have massive parallel computation [2j. However all 
these applications crucially rely on the possibility of maintaining quantum co- 
herence, that is, a defined phase relationship between the different components 
of linear superposition states, over long distances and for long times. This means 
that one has to minimize as much as possible the effects of the interaction of the 
quantum system with its environment and, in particular, decoherence, i.e., the 
rapid destruction of the phase relation between two quantum states of a system 
caused by the entanglement of these two states with two different states of the 
environment m- 

Quantum optics is a natural candidate for the experimental implementation 
of quantum information processing systems, thanks to the recent achievements 
in the manipulation of single atoms, ions and single cavity modes. In fact two 
quantum gates have been already demonstrated jSltiJ in quantum optical systems 
and it would be very important to develop strategies capable of controlling the 
decoherence in experimental situations such as those described in Refs. m- 

In this paper we propose a simple physical way to control decoherence and 
protect a given quantum state against the destructive effects of the interaction 
with the environment: applying an appropriate feedback. The feedback scheme 
considered here has a quantum nature, since it is based on the injection of an 
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appropriately prepared atom in the cavity and some preliminary aspects of the 
scheme, and its performance, have been described in Refs. pi8| 

2 A Feedback Loop for Optical Cavities 

Applying a feedback loop to a quantum system means subjecting it to a series 
of measurements and then using the result of these measurements to modify 
the dynamics of the system. Very often the system is continuously monitored 
and the associated feedback scheme provides a continuous control of the quan- 
tum dynamics. An example is the measurement of an optical field mode, such 
as photodetection and homodyne measurements, and for these cases, Wiseman 
and Milburn have developed a quantum theory of continuous feedback [3|. This 
theory has been applied in Refs. m to show that homodyne-mediated feedback 
can be used to slow down the decoherence of a Schrodinger cat state in an optical 
cavity. 

Here we propose a different feedback scheme, based on direct photodection 
rather than homodyne detection. The idea is very simple: whenever the cavity 
looses a photon, a feedback loop supplies the cavity mode with another photon, 
through the injection of an appropriately prepared atom. This kind of feedback 
is naturally suggested by the quantum trajectory picture of a decaying cavity 
field 1 1 I j , in which time evolution is driven by the non-unitary evolution operator 
exp{— 7 ta^a/ 2 } interrupted at random times by an instantaneous jump describ- 
ing the loss of a photon. The proposed feedback almost instantaneously “cures” 
the effect of a quantum jump and is able therefore to minimize the destructive 
effects of dissipation on the quantum state of the cavity mode. 

In more general terms, the application of a feedback loop modifies the master 
equation of the system and therefore it is equivalent to an effective modification 
of the dissipative environment of the cavity field. For example. Ref. shows 
that a squeezed bath uni can be simulated by the application of a feedback 
loop based on a quantum non-demolition (QND) measurement of a quadrature 
of a cavity mode. In other words, feedback is the main tool for realizing, in the 
optical domain, the so called “quantum reservoir engineering” m. 

The master equation for continuous feedback has been derived by Wiseman 
and Milburn [3, and, in the case of perfect detection via a single loss source, is 
given by 

p = -f<P{apa?) - -a?ap - -pa' a , ( 1 ) 

where 7 is the cavity decay rate and is a generic superoperator describing 
the effect of the feedback atom on the cavity state p. Eq. assumes perfect 
detection, i.e., all the photons leaving the cavity are absorbed by a unit-efficiency 
photodetector and trigger the cavity loop. It is practically impossible to realize 
such an ideal situation and therefore it is more realistic to generalize this feedback 
master equation to the situation where only a fraction 77 < 1 of the photons 
leaking out of the cavity is actually detected and switches on the atomic injector. 
It is immediate to see that (Q]) generalizes to 
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p = rj^(l>{apa)) + (1 — p)^apa) — ap — —pa) a 



(2) 



Now, we have to determine the action of the feedback atom on the cavity 
field this atom has to release exactly one photon in the cavity, possibly 

regardless of the field state in the cavity. In the optical domain this could be 
realized using adiabatic transfer of Zeeman coherence nni. 



2.1 Adiabatic Passage in a Three Level A Atom 

A scheme based on the adiabatic passage of an atom with Zeeman substructure 
through overlapping cavity and laser fields has been proposed US! for the gen- 
eration of linear superpositions of Fock states in optical cavities. This technique 
allows for coherent superpositions of atomic ground state Zeeman sublevels to be 
“mapped” directly onto coherent superpositions of cavity-mode number states. 
If one applies this scheme in the simplest case of a three-level A atom one obtains 
just the feedback superoperator we are looking for, that is 

^{p) = a\aa^)-^^^p{aa^)-^^^a , (3) 

corresponding to the feedback atom releasing exactly one photon into the cavity, 
regardless the state of the field. 

To see this, let us consider a three level A atom with two ground states |(;i) 
and 1 ^ 2 ), coupled to the excited state |e) via, respectively, a classical laser field 
f2{t) of frequency lol, and a cavity field mode of frequency u>. The corresponding 
Hamiltonian is 



H{t) = huja'^a + hoJeg\e){e\ - ihg{t) {\e){g 2 \a- | 5 ' 2 )(e|a''') 
+zni2(t)(|e)(gi|e— ‘-|gi)(e|e--‘) . (4) 

The time dependence of fi{t) and git) is provided by the motion of the atom 
across the laser and cavity profiles. This Hamiltonian couples only states within 
the three-dimensional manifold spanned by \g\,n), |e, n), \g 2 , n+ 1), where n de- 
notes a Fock state of the cavity mode. Of particular interest within this manifold 
is the eigenstate corresponding to the adiabatic energy eigenvalue (in the frame 
rotating at the frequency lo) En = nTioj, 



\En{t)) 



g{t)^/n+ l|gi, n) -b f?(t)|g 2 , n+1) 
y/f^^(t) + {n+ l)g^(t) 



(5) 



which does not contain any contribution from the excited state and for this rea- 
son is called the “dark state” . This eigenstate exhibits the following asymptotic 
behavior as a function of time 



f \gi,n) for n{t)/g{t)^0 
ll52,?^+l) for g{t)/n{t)^0 



Now, according to the adiabatic theorem m, when the evolution from time to 
to time ti is sufficiently slow, a system starting from an eigenstate of H{to) will 
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pass into the corresponding eigenstate of H(ti) that derives from it by continuity. 
This means that if the atom crossing is such that adiabaticity is satisfied, when 
the atom enters the interaction region in the ground state the following 
adiabatic transformation of the atom-cavity system state takes place 

\9i){gi\'^^Pn,m\n){m\ (7) 

n,m 

Iff2)(52| <^'^Pn,Tn\n+l){m+ 1| 

n,m 

= \92){g2 \ O . 

Roughly speaking, this transformation amounts to a single photon transfer from 
the classical laser field to the quantized cavity mode realized by the crossing 
atom, provided that a counterintuitive pulse sequence in which the classical laser 
field f2{t) is time-delayed with respect to g{t) is applied. FigureQJshows a simple 
diagram of the feedback scheme, together with the appropriate atomic config- 



I classical laser field 




atom injector 



Fig. 1. Schematic diagram of the photodetection-mediated feedback scheme pro- 
posed for optical cavities, together with the appropriate atomic configuration for 
the adiabatic transfer. 
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uration, cavity and laser field profiles needed for the adiabatic transformation 
considered. 

The quantitative conditions under which adiabaticity is satisfied are obtained 
from the requirement that the transition from the dark state |£’„(t)) to the other 
states be very small. One obtains [T^IT7| 



f2rt 



C5 9n 



^ T 



( 8 ) 



where T^ross is the cavity crossing time and fimax, gmax are the two peak inten- 
sities. 

The above arguments completely neglect dissipative effects due to cavity 
losses and atomic spontaneous emission. For example, cavity dissipation couples 
a given manifold |e,n), \g 2 ,n+ 1) with those with a smaller number of 

photons. Since ideal adiabatic transfer occurs when the passage involves a single 
manifold, optimization is obtained when the photon leakage through the cavity 
is negligible during the atomic crossing, that is 

, ( 9 ) 



where h is mean number of photons in the cavity. On the contrary, the tech- 
nique of adiabatic passage is robust against the effects of spontaneous emission 
as, in principle, the excited atomic state |e) is never populated. Of course, in 
practice some fraction of the population does reach the excited state and hence 
large values of gmax and fimax relative to the spontaneous emission rate 7 e are 
desirable. To summarize, the quantitative conditions for a practical realization 
of the adiabatic transformation 0 are 

^max 1 Qmax > > ’^7,7e , (10) 

which, as pointed out in \m. could be realized in optical cavity QED experi- 
ments. 

We note that when the adiabaticity conditions (jl 1)1) are satisfied, then also 
the Markovian assumptions at the basis of the feedback master equation 0 are 
automatically justified. In fact, the continuous feedback theory of Ref. |2| is a 
Markovian theory derived assuming that the delay time associated to the feed- 
back loop can be neglected with respect to the typical timescale of the cavity 
mode dynamics. In the present scheme the feedback delay time is due to the 
electronic trasmission time of the detection signal and, most importantly, by the 
interaction time Tcross of the atoms with the field, while the typical timescale 
of the cavity field dynamics is l/yh. Therefore, the inequality on the right of 
Eq. II I i )ll is essentially the condition for the validity of the Markovian approxi- 
mation and this a posteriori justifies our use of the Markovian feedback master 
equation 0 from the beginning. 



2.2 Properties of the Adiabatic Transfer Feedback Model 

When we insert the explicit expression (0 of the feedback superoperator into 
Eq. (0, the feedback master equation can be rewritten in the more transparent 
form 
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p = [ 2 apa) - a) ap- 



pa) a) 




( 11 ) 



that is, a standard vacuum bath master equation with effective damping coeffi- 
cient (1 — 77)7 plus an unconventional phase diffusion term, in which the photon 
number operator is replaced by its square root and which can be called “square 
root of phase diffusion” . 

In the ideal case 77 = 1, vacuum damping vanishes and only the unconven- 
tional phase diffusion survives. As shown in Ref. UBI, this is equivalent to say 
that ideal photodetection feedback is able to transform standard photodetection 
into a quantum non-demolition (QND) measurement of the photon number. In 
this ideal case, a generic Fock state |tt,) is obviously preserved for an infinite 
time, since each photon lost by the cavity triggers the feedback loop which, in a 
negligible time, is able to give the photon back through adiabatic transfer. How- 
ever, the photon injected by feedback has no phase relationship with the photons 
already present in the cavity and, as shown by II I I II . this results in phase dif- 
fusion. This means that feedback does not guarantee perfect state protection 
for a generic superposition of number states, even in the ideal condition 77 = 1. 
In fact in this case, only the diagonal matrix elements in the Fock basis of the 
initial pure state are perfectly conserved, while the off-diagonal ones always de- 
cay to zero, ultimately leading to a phase-invariant state. However this does not 
mean that the proposed feedback scheme is good for preserving number states 
only, because the unconventional “square-root of phase diffusion” is much slower 
than the conventional one (described by a double commutator with the number 
operator) . 

In fact the time evolution of a generic density matrix element in the case of 
feedback with ideal photodetection 77 = I is 



Pn,m{t) = exp 




/^n,m(0) ) 



( 12 ) 



while the corresponding evolution in the presence of standard phase diffusion is 



Pn,m(t) = exp 





(13) 



Since 

{n — rriY > iy/n — y/m)^ = — ^ ^ \/ n,m (14) 

each off-diagonal matrix element decays slower in the square root case and this 
means that the feedback-induced unconventional phase diffusion is slower than 
the conventional one. 

A semiclassical estimation of the diffusion constant can be obtained from the 
time evolution of the mean coherent amplitude {a(t)). In fact, phase diffusion 
causes a decay of this amplitude as the phase spreads around 27 t, even if the 
photon number is conserved. In the presence of ordinary phase diffusion the 
amplitude decays at the rate 7/2; in fact 
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(a{t)) = Tr {ap{t)} = ^ Vn+ lpn+i,n{t) , (15) 

n—0 

and using Eq. (d one gets 

{a{t)) = e-^‘/2(a(0)) . 



In the case of the square root of phase diffusion, Eqs. (EJ and d instead yield 



(a(t)) = Tr{a(t)p(0)} , (16) 

where the Heisenberg-like time evolved amplitude operator a(t) is given by 

(17) 



a{t) = exp "I ~ ^ aal — V ala^ a . 

In the semiclassical limit it is reasonable to assume a complete factorization 
of averages, so to get 

(a(t)) = exp Y (Vh + 1 - Vh) | (a(0)) , (18) 

which, in the limit of large mean photon number n, yields 



{a{t)) = exp<{ (a(0)) . 



(19) 



This slowing down of phase diffusion (similar to that taking place in a laser well 
above threshold) means that, when the feedback efficiency rj is not too low, the 
“lifetime” of generic pure quantum states of the cavity field can be significantly 
increased with respect to the standard case with no feedback (see En. ll ill . 



3 Optical Feedback Scheme for the Protection of Qubits 

Photon states are known to retain their phase coherence over considerable dis- 
tances and for long times and for this reason high-Q optical cavities have been 
proposed as a promising example for the realization of simple quantum circuits 
for quantum information processing. To act as an information carrying quantum 
state, the electromagnetic fields must consist of a superposition of few distin- 
guishable states. The most straightforward choice is to consider the superposition 
of the vacuum and the one photon state a|0) -|- /3|1). However it is easy to un- 
derstand that this is not convenient because any interaction coupling |0) and 
|1) also couples |1) with states with more photons and this leads to information 
losses. Moreover the vacuum state is not easy to observe because it cannot be 
distinguished from a failed detection of the one photon state. A more convenient 
and natural choice is polarization coding, i.e., using two degenerate polarized 
modes and qubits of the following form 

1^) = -I- |0) = a|0, 1) -I- /3|1, 0) , 



( 20 ) 
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in which one photon is shared by the two modes m- In fact this is a “natural” 
two-state system, in which the two basis states can be easily distinguished with 
polarization measurements; moreover they can be easily transformed into each 
other using polarizers. 

Polarization coding has been already employed in one of the few experimental 
realization of a quantum gate, the quantum phase gate realized at Caltech jS|. 
This experiment has demonstrated conditional quantum dynamics between two 
frequency-distinct fields in a high-finesse optical cavity. The implementation of 
this gate employs two single-photon pulses with frequency separation large com- 
pared to the individual bandwidth, and whose internal state is specified by the 
circular polarization basis as in (EUll . The conditional dynamics between the two 
fields is obtained through an effective strong Kerr-type nonlinearity provided by 
a beam of cesium atoms. This conditional dynamics of the quantum gate has 
to be unitary with a high degree of accuracy during the operation time, i.e., 
decoherence has to be negligible; the experiment of Ref. 0 has been performed 
in the bad cavity limit, in which the main dissipative effects and main source of 
decoherence is the photon leakage outside the cavity. It is therefore quite natural 
to see if the atomic feedback scheme described in detail above is able to protect 
the “flying” qubits of Ref. |^. To be more specific, we shall not be concerned with 
the protection of the quantum gate dynamics, but we shall focus on a simpler 
but still important problem: protecting an unknown input state for the longest 
possible time against decoherence. We shall therefore consider a single qubit, 
i.e., a single frequency whose internal state is specified by the polarization. 

One has to apply an adiabatic transfer feedback loop as that of Fig. 1 to 
each polarized mode independently. This can be done using polarization-sensitive 
detectors (for example a polarized beam splitter and two detectors) and two 
classical laser fields with opposite circular polarization. In this way one has two 
similar feedback loops where one polarized mode is involved in the transition 
l^i) — !■ 1 ^ 2 )) and the other mode participates to the reversed transition. In this 
way each mode gets a photon with the right polarization. A schematic description 
of the scheme is given by Fig. 2. 

For a quantitative characterization of how the feedback scheme is able to 
protect an initial pure state we study the fidelity F{t) 

F{t) = Tr{p{0)p{t)} (21) 

i.e., the overlap between the final and the initial state p(0) after a time t. In 
general 0 < F{t) < 1. For an initially pure state |?/'(0)), F{t) is in fact the 
probability to find the system in the initial state at a later time. A decay to 
an asymptotic limit is given by the overlap {il){0)\p{oo)\'ip(0)) . Since the input 
state we seek to protect is unknown, the protection capabilities of the feedback 
scheme are better characterized by the minimum fidelity, i.e., the fidelity of 
Eq. (EB minimized over all possible initial states. Moreover we shall consider a 
class of initial states more general than those of Eq. lETTIi . i.e.. 



\ip) = a\n, m) + /3|m, n) . 



( 22 ) 
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Since the two polarized modes evolve independently, one has to solve the 
master equation (HU to calculate the fidelity. This can be done easily and one 
gets the following expression for the minimum fidelity 



± rr 



1 ^g-(l-7))7i("+"i) _|_ g-7t("+™-2r;Vnm)^ 



(23) 



classical laser fields 




Fig. 2. Adaptation of the feedback scheme of Fig. 1 to the polarization coding 
case; there is a feedback loop for each circularly polarized mode and the two 
loops are separated by the polarized beam splitter PBS. 



In the absence of feedback (77 = 0), this expression becomes Fmin{t) = 
exp{— 7 t(n+m)} showing that in this case, the states most robust against cavity 
damping are those with the smallest number of photons, m + n = 1 , i.e., the 
states of the form of Eq. (121 )ll . Moreover, in a typical quantum information pro- 
cessing situation, one has to consider small qubit “storage” times t with respect 
to 7 “^ so to have reasonably small error probabilities in quantum information 
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storage. Therefore the protection capability of an optical cavity with no feedback 
applied is described by 

= 1 - . (24) 

If we now consider the situation in the presence of feedback (Eq. 1I2.SII 1. it is 
possible to see that, for fixed, non-unit efficiency rj, the best protected state are, 
as in the no-feedback case, the states with only one photon a|0, 1) -I- /3|1, 0) and 
therefore the corresponding minimum fidelity for 77 < 1 is given by 

i . (25) 

This shows that feedback increases the “lifetime” of a generic qubit state with 
respect to the no-feedback case, even if, in this non-ideal case, one has a scaling 
of the error probability by a factor (1 — rj/2) only. 

It is interesting to consider the ideal case 77 = 1, even if it is not realistic, 
because in this case the situation changes qualitatively. In fact Eq. (E) becomes 

Frmnit) = ^ (l + . (26) 

SO that it is easy to see that in this case it becomes convenient to work with a 
large number of photons and that the best protected qubit states are those of 
the form 

It/j) = a|n, n-l- 1) -l-/3|n-|- l,n) (27) 

whose corresponding minimum fidelity is 

Therefore, in the ideal photodetection case and using qubits of the form of 
the probability of errors in the storage of quantum information can be made 
arbitrarily small. 

The feedback method proposed here to deal with decoherence in quantum 
information processing is different from most of the proposals made in this re- 
search field, which are based on the so called quantum error correction codes 
E3’ In our case, feedback allows a physical control of decoherence, through a 
continuous monitoring and eventual correction of the dynamics and in this sense 
our approach is similar in spirit to the approach of Ref. I2321- Quantum error 
correction is instead a way to use software to preserve linear superposition states. 
Essentially in these approaches the entangled superposition state of I qubits is 
“encoded” in a larger number of qubits n, so that, assuming that only a frac- 
tion of qubits t/n decoheres, it is possible to reconstruct the original state with 
a suitable decoding procedure. However, due to existence of a lower (quantum 
Hamming) and an upper (quantum Gilbert-Varshamov) bound for the “code 
rate” l/n |2S|, these quantum error correction codes can be applied and become 
efficient only at sufficiently small probability of error t/n. For this reason, even if 
under realistic conditions our feedback scheme achieves only a moderate reduc- 
tion of the error probability, it could be useful when used in conjunction with 
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quantum error correction techniques. The feedback scheme would realize the 
preliminary reduction of the error probability, which is necessary for an optimal 
implementation of efficient error correction schemes. 
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Adiabatic Controlled-NOT Gate for Quantum 

Computation 
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Abstract. A new variant of the controlled-NOT quantum logic gate is 
proposed. The gate is based on adiabatic level-crossing dynamics of the 
q-bits. An important advantage of the adiabatic dynamics, as opposed 
to the ac-driven Rabi transitions, is its considerable insensitivity to the 
unavoidable spread of the gate parameters. The gate has a natural im- 
plementation in terms of the Cooper pair transport in arrays of small 
Josephson tunnel junctions. The decoherence rate for this implementa- 
tion is estimated. 



1 Introduction 

The invention of quantum algorithms and quantum information processing (see, 
e.g., the reviews P) changed the foundations of the theoretical computer science 
by demonstrating that the information is processed differently by quantum and 
classical systems. In an ideal world, a quantum algorithm implemented on a 
quantum computer can radically outperform the classical algorithm by making 
use of quantum parallelism inherent in entangled quantum states. Examples 
of problems which can be efficiently solved with quantum algorithms include 
factorization of large numbers [2j and database search 0. 

Practical realization of a quantum computer requires, however, very precise 
and reversible time evolution of complex quantum mechanical systems, the fact 
that gives rise to serious doubts 0 as to whether even the simplest version of a 
quantum computer will ever become a reality. It is therefore important to look 
into various possible ways of implementing simple elements of quantum com- 
puter - quantum logic gates in order to find the optimal approach to building 
such a computer. Generally speaking, a quantum gate should satisfy two contra- 
dictory requirements: being isolated from the outside world in order to maintain 
quantum coherence, and interacting with other q-bits, read-out system, etc., in 
order to perform a meaningful computation. Existing quantum gate proposals 
use various systems including ion traps m, electrodynamic cavities |Z], semi- 
conductor quantum dots |iSI9] . NMR spectroscopy P33> quantum flux dynamics 
in SQUIDs dH. Some of these proposals, for instance, ion-trap or NMR, are 
characterized by potentially very long relaxation times, since the gates in these 
proposals are well isolated physically from the outside world. However, due to 
the very same reason, these gates can not be combined easily to form larger com- 
puting systems. For other gates, for instance, based on semiconductor quantum 
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dots, the situation is the opposite. In principle, it is not too difficult to integrate 
them into larger structures, but there seems to be very little hope of reducing 
decoherence rates to a level acceptable for quantum computation. 

The aim of this work is to suggest another possible realization of quantum 
gates which is based on manipulation of the charge states of small Josephson 
tunnel junctions. This approach combines both the potential for relatively long 
relaxation times and large degree of design flexibility, and probably represents 
one of the best, if not the best, hope for realization of a quantum computer of 
medium complexity. Such a computer, while not being sufficient for factorization 
of large numbers of practical interest, could be sufficiently complex to perform 
privacy amplification in quantum communication channels. 

The basic universal set of quantum logic gates consists of the one q-bit gates 
and two q-bit controlled-NOT (CN) gate. In practically any implementation, 
including the one discussed below, the dynamics of the two q-bit gates contains 
all elements of the one q-bit operation, and therefore, we can concentrate on the 
two q-bit CN gate. The operation of this gate can be described simply as inversion 
of the target q-bit states when the control q-bit is the “1” state. The state of 
the control q-bit should be unchanged during this operation. Typically, the CN- 
operation is achieved through the use of the ac-driven Rabi transition between 
the q-bit states 1 51 till 01 . This approach becomes very problematic for solid state 
implementations of the quantum gates because of the unavoidable variations 
of parameters from gate to gate in solid state structures. We propose another 
general scheme of the CN gate which uses adiabatic transitions between the 
q-bit states and is more suitable for implementation in systems of small tunnel 
Josephson junctions. Although adiabatic approach does not solve completely the 
problem of parameter variations it makes it less severe. 



2 Adiabatic CN-Gate 

The main idea of the adiabatic CN-gate is as follows. Interaction between the 
control and target q-bit makes the energy difference between the two states of 
the target q-bit dependent on the state of the control q-bit. If the control q-bit is 
in the state “1” of the computational basis, the energy difference is smaller and 
under application of the time-dependent bias the target q-bit passes through the 
level-crossing point, where the energies of its two states are equal - see Fig. 1. 
If the rate of the bias increase is sufficiently small, the two states of the target 
q-bit exchange their occupation probabilities. When the control q-bit is in the 
“0” state, the energy difference is larger and the same bias pulse does not drive 
the target q-bit through the level-crossing point. In this case, the occupation 
probabilities remain the same. The tunnel coupling between the states of the 
control q-bit is suppressed during the entire process so that their occupation 
probabilities do not change in either case. This time evolution realizes CN-gate 
operation provided that the parameters of the two q-bits are chosen in such a 
way that the dynamic phases accumulated in the system evolution along all four 
“paths” are equal. 
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Fig. 1. Time evolution of the energy levels of a controlled-NOT quantum logic 
gate based on the adiabatic level-crossing dynamics. 

To implement this dynamics of the two q-bit system we need to control both 
the energy difference ej{t) between the two states of a q-bit in the computa- 
tional basis |0) and |1), and the tunneling amplitude f 2 j{t) in this basis, i.e., the 
Hamiltonian of the system should be: 



where aj are the Pauli matrices for the jth q-bit, and rj is the energy of interac- 
tion between the q-bits. Although the basic gate dynamics does not require the 
modulation of the interaction strength 77, this interaction does not allow to make 
the energies of all four gate states equal after the gate operation. This means 
that the relative phases of these states will continue to evolve at a rate on the 
order oirf/Ti and we need to be able to manipulate the gate on a time scale much 
shorter than h/rj, which in its turn should be much shorter than the time scale 
of the adiabatic dynamics. This long hierarchy of time scales presents a serious 
problem that exists for other proposals of quantum gates as well. A better so- 
lution is to design a gate in a way that allows to switch the interaction on and 
off, despite the fact that this makes the design appreciably more complicated. 

If the interaction energy rj in the Hamiltonian can be controlled, we can 
separate the gate dynamics into three steps. At first, the two q-bits are brought 
into contact by switching on rj and f?2 (f?i is completely suppressed throughout 
the gate operation). Simultaneously the energy difference £2 between the states 



i? = ^ + f2j(t)axj) -k Wzicrz2 , 

i=i.2 



( 1 ) 
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of the target q-bit is set to some nonvanishing initial value. Then this energy 
difference is increased while all other energies are kept constant. This step is the 
central “level-crossing” part of the gate dynamics. During the final third step 
both rj and f ?2 are suppressed back to zero so that the two q-bits are effectively 
separated and £2 can also be reduced to zero. 

The precise functional dependence of f? 2 , £ 2 , and t] on time does not qual- 
itatively affect the gate dynamics, as long as all these parameters are changed 
gradually. The limitation on the rate of the parameter variations is associated 
with the unwanted transitions between the instantaneous energy eigenstates of 
the system which are brought about by these variations. These transitions vio- 
late the correct adiabatic dynamics which assumes that the system remains at 
all times in the same eigenstate it occupied initially. Adopting a simple model 
time dependence of the energy difference £ 2 (t): 

£ 2 (t) = £ -I- utanh(t/r) , (2) 

and using the standard quasiclassical approach we can calculate explicitly the 
probability p that the system makes an unwanted transition during the central 
second step of the gate operation. This simple calculation confirms the expected 
result that the probability p reaches maximum when the system passes through 
the level crossing-point and is given then by the standard Landau-Zener expres- 
sion: 

PLZ = exp{ ^1- (3) 

Here 17 is the magnitude of the tunnel amplitude 172(f) that is kept constant 
during this step of the gate operation. Thus, the condition p^z 1, i.e., r 
hu/ fP, ensures the correct adiabatic dynamics of the gate. 

This implies that dynamics of the occupation probabilities of the states of 
adiabatic CN gate is not sensitive to the precise values of the parameters in the 
Hamiltonian Q provided that they satisfy several constrains which ensure the 
gate operation shown in Fig. 1: 

r] + u — e:^fi, ry — It — £ <C — f7 , u — p — e<^—fi. (4) 

If all these conditions are satisfied, the evolution of the absolute values of the 
occupation amplitudes of the four gate states corresponds to the correct CN 
operation: 

\oi^j\ Woj\ I |aio| — *■ |aii| , |aii| — *■ l^iol • 

(The indices i and j denote the states of the control and target q-bit respec- 
tively.) Besides this time evolution of the absolute values of the correct gate 
dynamics requires also that phases of the four states accumulated in the process 
of the gate evolution are equal modulo 27 t. This can be achieved by adjusting the 
bias £ 1.2 of the two q-bits and the energy splitting 172 during the gate operation. 
The bias £1 controls the relative phases of the pairs of states evolving from the 0 
and 1 state of the control q-bit, while £2 and 172 control the phases within each 
pair. With such an adjustment of the phases, the adiabatic time evolution of the 
coupled q-bits represents correctly the CN quantum logic gate. 
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3 Small Josephson Junctions as Q-Bits 



The adiabatic CN-gate can be naturally implemented in systems of small Joseph- 
son tunnel junctions in the Coulomb blockade regime - see, e.g., [ 1 41 1 b) . The 
energy diagram of an elementary building block of such a system, a single junc- 
tion, is shown in Fig. 2. The dominant contribution to the junction energy is 
given by the charging energy U{n) of the junction as a capacitor: 



U{n) 



{2en - Qo)^ 
2C 



where C is the junction capacitance, n is the number of Cooper pairs trans- 
ferred across the junction, and Qq is the charge induced by the external bias 
voltage Vo across the junction, Qq = VoC. In general, the states with different 
n’s are separated by large energy gaps on the order of elementary charging en- 
ergy Ec = e^/2C. However, when the external voltage Vq induces the charge of 
approximately one electron on the junction capacitance, Qo ~ e, the two state, 
n = 0 and n = 1 are nearly degenerate and are separated from all other states 
by the large energy gaps - see Fig. 2. In this regime the junction behaves effec- 
tively as a two-level system. The energy difference e between the level of this 
two-level system is controlled by the external voltage e = 2e(e — Qo)/C, while 
the amplitude C of tunneling between them is determined by the Josephson 
coupling energy Ej of the junction, C = Ej /2. The Josephson coupling energy 
depends on the tunnel resistance Rt of the insulator barrier between the elec- 
trodes, and for the electrodes with equal superconducting energy gaps A is equal 
to TTTiA/4,e^RT - see, e.g, [TTij . 




Fig. 2. Energy diagram of a tunnel Josephson junction in the Coulomb blockade 
regime biased with the external voltage that induces the charge Qo — e on the 
junction capacitance. The two states n = 0, 1 are nearly-degenerate and the 
junction behaves effectively as a macroscopic two-level system. 
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Thus, the appropriately biased small Josephson tunnel junction is a macro- 
scopic two-level system, with the two states represented by the position of a 
single Cooper pair on the left or right electrode of the junction. In principle, this 
system can be used as a q-bit of the quantum logic gates. However, if q-bits are 
represented with single junctions, neither the tunneling amplitude Ejj2 nor the 
coupling strength of the two q-bits which is determined by the coupling capaci- 
tance between the junction electrodes can be modulated in time as required by 
the design of the adiabatic CN gate. In particular, to realize adiabatic dynamics 
it should be possible to suppress both the tunneling amplitude and interaction 
strength to zero between the active cycles of the gate operation. This problem 
can be circumvented if q-bits are represented not with individual junctions but 
with the one-dimensional arrays of junctions. In an array, the tunneling ampli- 
tude 12 between the two islands of the array can be effectively modulated by the 
gate voltages applied to the islands of the array, and the interaction energy rj of 
charges in the array decreases exponentially with the distance between them. 

To make a q-bit out of a uniform array, all islands should have individual gate 
electrodes supplying the gate voltages Vj (Fig. 3a,b), and two internal islands of 
the array should be biased with the voltages EejCt, where Ct = (Cq - l-dCCo)^/^ 
is the total capacitance of an internal island in the array - see, e.g., H3|, and 
C, Co are, respectively, the junction capacitance, and the capacitance between 
each island and its gate electrode (Fig. 3b). These voltages induce the charges 
e and — e on the two islands, so that the two charge configurations of the array: 
one with no Cooper pair transferred across any junction and another one with 
a Cooper pair transferred between the two biased islands, from e to — e, have 
the same energy. This means that if the bias conditions do not deviate strongly 
from these conditions, all other charge configurations of the array have much 
larger energies and the array dynamics is equivalent to the two-state dynamics 
that can be described in terms of the tunneling of a single Cooper pair between 
the two islands. In this regime the array can be viewed as a q-bit with the two 
positions of the Cooper pair on one or another island representing the two states 
of the computational basis of this q-bit. 

If the two islands containing the q-bit states are separated by m junctions, the 
amplitude of tunneling 17 between them depends exponentially on the separation 
m. The dominant contribution to 17 comes from the process in which the Cooper 
pair is transferred sequentially through the junctions separating the islands, and 
can be written as: 



7^ 1 p 



fc=i 



2Et ’ 



( 5 ) 



where Ek are the energies of the intermediate charge configurations resulting 
from the Cooper pair transfer through the first k junctions. These energies are 
controlled by the gate voltages applied to the intermediate islands. 

The most important feature of the Cooper pair states forming q-bit basis 
is that they can be moved along the array by the adiabatic level-crossing tran- 
sitions similar to those discussed above. A Cooper pair is transferred between 
the two adjacent islands when a gate voltage of the initially occupied island 
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(a) 





Fig. 3. Schematic layout (a) and equivalent electrostatic circuit (b) of an array 
of small Josephson junctions representing one q-bit. (c) The controlled-NOT 
gate obtained by coupling of the two arrays. 



is increased/decreased while the gate voltage of the neighboring island is de- 
creased/increased adiabatically past ejCt- The adjacent islands are coupled by 
the tunneling amplitude ifj/2, and the Cooper pair is transferred with the prob- 
ability exponentially approaching one if the rate of change of the gate voltages 
is small on the scale of this amplitude. Similar manipulation of the gate voltages 
also shifts the empty state of the q-bit by one island. In this way it is possible 
to move the q-bit states around, either shifting both states along the array, or 
changing the separation m between the two states. 

This dynamics is analogous to the one used in the so-called single-electron 
fTTH and single Cooper pair pump, or single-electron parametron ^H|. It 
allows to implement the general scheme of the adiabatic CN gate with the two 
coupled arrays representing the two q-bits of the gate (Fig. 3c). As a first step of 
the gate operation, the q-bit states in both arrays are moved towards the ends of 
the arrays where they can interact via the coupling capacitance Ci. The states 
of the controlled q-bit in the first array have sufficiently large separation m so 
that their tunnel coupling fl\ is negligible. By contrast, the states of the target 
q-bit in the second array are put on the adjacent islands in order to maximize 
their tunnel coupling, C 2 = ifj/2. Then a pulse of the bias voltage is applied to 
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the first junction of the target q-bit array. If the control q-bit is in the “1” state, 
a Cooper pair occupies the island of the first array closest to the second array 
and creates additional potential drop SV across the junction of the target q-bit: 



6V = 



8eQ 

(Co + Ct + 2c)(Co + Ct + 4a) ■ 



( 6 ) 



In this case the bias pulse drives the target q-bit through the level-crossing 
point so that the occupation probabilities of its states are interchanged. When 
the control q-bit is in the “0” state, the Cooper pair of this q-bit is inside the 
array and does not produce extra voltage across the target q-bit junction, which 
then does not reach the level-crossing point, and the occupation probabilities 
of its states remain the same. During the last step of the gate operation it is 
returned to its initial configuration, i.e., the separation of the states of the target 
q-bit is increased to suppress the tunnel amplitude to zero, and the states of 
the both q-bits are shifted inside the arrays. Then the interaction of the q-bit 
states becomes negligible due to screening by the gate electrodes, which is known 
to lead to the exponential suppression of the interaction energy 77 between two 
Cooper pairs separated by m junctions of one array m 



V = 



(M! 

Ct 



x = 



2C 



2C + Co + Ct 



(7) 



This implementation of the CN quantum gate can only be practical if it is 
stable against deviations of the real gate structure from the idealized model used 
above. Such deviations are fundamentally unavoidable in all macroscopic real- 
izations of quantum gates. For instance, the real electrostatics of the Josephson 
junction gate is much more complicated that the model characterized by the 
two nearest-neighbor capacitances C and Cq. It involves full capacitance ma- 
trix Cij in which even remote islands interact with each other, and should also 
describe small fluctuations of the nearest-neighbor capacitances around their av- 
erage values. An important advantage of the adiabatic approach is that these 
complications can be compensated for by the adjustment of the bias voltages 
and do not change qualitatively the gate dynamics. Indeed, the adiabatic trans- 
fer of a Cooper pair depends only on the resonance condition that the energies 
of all Cooper pair states along the array are the same, which ensures correct 
transfer of the occupation probabilities of the gate states. The bias voltages can 
always be tuned to satisfy the resonance condition regardless of the form of the 
capacitance matrix. A practical proof of this statement is provided by the ex- 
perimentally demonstrated operation of a similar system, single-electron pump, 
with accuracy better than 10’ -6 [E|. 

The only instance when the gate dynamic relies heavily on the simplified 
model of the array electrostatic is in the assumption of the exponential screen- 
ing of the electrostatic interaction inside the array. In the realistic model of elec- 
trostatics, interaction at large distances depends on the external environment of 
the array. The exponential screening of the interaction can still be obtained even 
in this case, but requires that the array is placed between the two conducting 
ground planes. 
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These considerations show that dynamics of the occupation probabilities of 
the gate states is indeed insensitive to the week disorder in the gate parameters. 
However, the proper dynamics of the system as quantum logic gate requires 
also that the phases of the occupation amplitudes accumulated during the gate 
operation are all equal modulo 27 t. In this respect, fluctuations of the junction 
parameters do present a problem since they make the dynamic phases of the 
gate states unpredictable. This problem can be resolved if the disorder in the 
parameters is static on a sufficiently long time scale. In this case, the phases can 
be measured and compensated for by the fine-tuning of the gate voltages. 

In order to measure the phases, we need to transform them into the occupa- 
tion probabilities of the gate states which in their turn can be measured with 
a singe-electron electrometer (see, e.g., |23, Chapter 9). An electrometer mea- 
sures an average charge of the island and therefore gives information about the 
occupation probabilities of the gate state, but is insensitive to the phase of the 
occupation amplitudes. Suppose that as a result of a prior measurements, we 
know that the occupation probabilities of the two q-bit states are p\ and p 2 ■ 
The two states are decoupled (the corresponding tunneling amplitude 17 is zero) 
and their energies are equal, so that there is some stationary phase difference 
ip between their occupation amplitudes. The phase ip can be transformed into 
the occupation probability by rotation U of the q-bit states in the Hilbert state, 
U = exp{i7T(Ta,/4}. This rotation is achieved if the barrier between the states is 
reduced temporarily in such a way that 

J dtQ{t) = -Kh/A . 

The resulting occupation probabilities 

qi,2 = 1/2=F (piP2)^^^sinv?, 

depend on the phase p, and by measuring them we can measure ip. After the 
phase is known it can be compensated for by adding an extra voltage pulses at 
the end of the gate operation. With this fine-tuning, the gate dynamics becomes 
effectively independent of the static disorder in the gate parameters. 

4 Decoherence Rate and Parameter Estimates 

The above discussion assumes that the energy relaxation and associated with it 
time-dependent fluctuations of the phase are negligible. There are several dissipa- 
tion and dephasing mechanisms in the Josephson tunnel junction systems. Some 
of them are well understood and can be controlled within certain limits. One 
of these mechanisms is the quasiparticle tunneling. In general, it coexists with 
the Cooper pair tunneling and makes junction dynamics irreversible. However, 
if both the temperature T and charging energy Eq of the junctions are much 
smaller than the superconducting energy gap A, the quasiparticle tunneling is 
suppressed by the parity effects to a level where it can be negligible on 
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the macroscopic time scales mrm . Another dissipation mechanism is coupling 
to the electromagnetic excitations supported by the system of superconducting 
electrodes. A Cooper pair oscillating between the two islands creates oscillat- 
ing currents in the islands and electric fields around the islands which couple to 
these modes. The power P lost to electromagnetic modes depends on the specific 
geometry of the islands and connecting them tunnel junctions. Part of the losses 
comes from the direct dipole radiation from the junctions and can be estimated 
as radiation of a dipole of length equal to the length d of the junction electrodes: 



Pd 



dTreoc^ 



( 8 ) 



The radiated power is not exponentially small, nevertheless it decreases suffi- 
ciently rapidly with decreasing ratio of the island size to the radiation wave- 
length A ~ c/w at frequency lo ~ Ej/h. Therefore, to keep this type of radiation 
losses small the islands of the junction arrays should be much smaller than the 
wavelength at frequency Ej/fi, the condition that is always satisfied in small 
junctions. 

The crucial contribution to radiation losses comes from the coupling to elec- 
tromagnetic modes supported by essentially “infinite” external gate electrodes 
supplying bias voltages to the islands. In the relevant regime with Co ^ C, 
the power dissipated into these modes can be estimated in terms of the wave 
impedance p of the gate electrodes as 



(9) 

We see that this dissipation mechanism limits the magnitude of the island capac- 
itance to the gate electrodes Cq. In the simple model of the gate electrostatics, 
Co determines also the number of islands of the junction array that are polar- 
ized by a single Cooper pair, and restriction on Co translates into the limitation 
on how small the number of junctions in the arrays can be. If however, one in- 
troduces ground planes which give rise to extra stray capacitances of the array 
islands, these two limitations becomes uncoupled. In any case, for realistic val- 
ues of the parameters (see the estimates below) the losses (EJ in the external 
electrodes should give the dominant contribution to decoherence for the Cooper 
pair tunneling. 

Besides these “controllable” mechanisms of dissipation that depend on the 
gate geometry, the Cooper pair tunneling in the junction arrays is affected also 
by the “internal” dissipation in all elements of the arrays. The most important 
source of noise and dissipation of this kind is the 1// charge noise in the in- 
sulators surrounding the junctions: substrate and tunnel barriers. The strength 
of the noise is material dependent and can not be estimated from first princi- 
ples. Experimentally, characteristic time scale of the charge noise varies from 
millisecond range m to seconds and hours 123, and is much longer that charac- 
teristic time of the Cooper pair tunneling %/ Ej which determines the rate of the 
gate operation. Therefore, the gate can go through the large number of cycles 
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of operation before the decoherence due to the charge noise starts to affect its 
dynamics. 

In conclusion, we summarize the conditions that should be satisfied by junc- 
tion arrays in order to operate as quantum logic gates. The first set of conditions 
is given by the following string of inequalities: 

T Ej Ec A. (10) 

The two limiting energy scales in this relations, temperature T and energy gap A, 
are practically constrained by the available refrigeration technology and super- 
conducting materials. The lower limit is set by the typical electron temperature 
attainable in experiments with the dilution refrigerator and is on the order of 30 
mK. The upper limit can not be much larger than the energy gap of niobium, 
or its compounds, i.e., about 20 K. The ratio of the Josephson coupling energy 
Ej to the charging energy Ec can not be varied arbitrarily because of the tech- 
nological limitations on the critical current density that can be obtained while 
preserving the quality of the tunnel junction. Conditions li I oil are satisfied if we 
take, for instance, Ej ~ 1 K, and Ec ~ 3 K. This value of Ec corresponds to 
the junction capacitance C ~ 0.5 fF, which for a typical specific capacitance of 
a tunnel junction, 0.1 pF //rm^, requires the junction area of about 70 x 70 nm^. 
With this area, the cited Ej value corresponds to the critical current density jc 
about 10 and the total critical current Ic = 2eEj/h ~ 50 nA. Exper- 

imentally, this value of jc is within the range of current densities that can be 
achieved without the degradation of the tunnel junction quality m- 

Another condition on the junction array as a CN gate is that the number N 
of junctions in it is much larger than its screening length: 



Here C* is the total stray capacitance of the array islands which include ca- 
pacitance Co to the gate electrodes and capacitance to the ground planes. This 
condition does not represent a serious obstacle to realization of a CN gate. Spe- 
cific values of N and C* are dictated by the convenience of fabrication of either 
longer arrays or larger capacitances to the ground. 

The most difficult is the condition that the probability a of the decoherence- 
induced error during one cycle of the gate operation is small. Estimating the 
period of this cycle roughly as Ti/Ej we obtain from eq. (0) that the lower 
bound on a is: 



Co 

^ C’ h ' 



( 11 ) 



The values of parameters that are typical for existing experiments (in which 
no effort was maid to decrease a) are Cq/C ~ 0.1, and p ~ 300 Ohm |2H|- 
(The latter value corresponds to a narrow, about 1pm, electrode.) In this case 
a ~ 10“^. The error probability can be substantially reduced by making coupling 
capacitance Cq smaller, and gate electrodes wider thus decreasing p. Although 
only experiments can tell what is the limit to decrease in decoherence rate, it is 
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reasonable to expect that a can be reduced further by a few orders of magnitude 
to a value about 10“®. 

The author thanks J.P. Pekola and K.-A. Suominen for critical reading of 
the manuscript. This work was supported by ONR. 
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Abstract. We briefly review the development and theory of an exper- 
iment to investigate quantum computation with trapped calcium ions. 
The ion trap, laser and ion requirements are determined, and the param- 
eters required for simple quantum logic operations are described. 
(LAUR 98-314) 



1 Introduction 



In the last 15 years various authors have considered the generalization of infor- 
mation theory concepts to allow the representation of information by quantum 
systems. The introduction into computation of quantum mechanical concepts, in 
particular the superposition principle, opened up the possibility of new capabili- 
ties, such as quantum cryptography Q, that have no classical counterparts. One 
of the most interesting of these new ideas is quantum computation, first pro- 
posed by Benioff 0. Feynman 0 suggested that quantum computation might 
be more powerful than classical computation, a notion which gained further 
credence through the work of Deutsch ^ . However, until quite recently quan- 
tum computation was an essentially academic endeavor because there were no 
quantum algorithms that exploited this power to solve useful computational 
problems, and because no realistic technology capable of performing quantum 
computations had been envisioned. This changed in 1994 when Shor discovered 
quantum algorithms for efficient solution of integer factorization and the discrete 
logarithm problem ISEl, two problems that are at the heart of the security of 
much of modern public key cryptography Later that same year Girac and 
Zoller proposed that quantum computational hardware could be realized using 
known techniques in the laser manipulation of trapped ions |E] • Since then inter- 
est in quantum computation has grown dramatically, and remarkable progress 
has been made: a single quantum logic gate has been demonstrated with trapped 
ions jOj; quantum error correction schemes have been invented mUD; several 
alternative technological proposals have been made [2 1 and quan- 
tum algorithms for solving new problems have been discovered [IMlYllbiro] . In 
this paper we will review our development of an experiment to investigate the 
potential of quantum computation using trapped calcium ions unj. 
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The three essential requirements for quantum computational hardware are: 
(1) the ability to isolate a set of two- level quantum systems from the environment 
for long enough to maintain coherence throughout the computation, while at the 
same time being able to interact with the systems strongly enough to manipulate 
them into an arbitrary quantum state; (2) a mechanism for performing quantum 
logic operations: in other words a “quantum bus channel” connecting the various 
two-level systems in a quantum mechanical manner; and (3) a method for reading 
out the quantum state of the system at the end of the calculation. 



Z 




Fig. 1. A schematic illustration of an idealized laser-ion interaction system; 
is the wave vector of the single addressing laser. 



All three of these requirements are in principle met by the cold trapped ion 
quantum computer. In this scheme each qubit consists of two internal levels of an 
ion trapped in a linear configuration. In order to perform the required logic gates, 
a third atomic state known as the auxiliary level is required. The quantum bus 
channel is realized using the phonon modes of the ions’ collective oscillations. 
These quantum systems may be manipulated using precisely controlled laser 
pulses. Two distinct types of laser pulse are required: “V” type pulses, which 
only interact with the internal states of individual ions, and “U” type pulses 
which interact with both the internal states and the external vibrational degrees 
of freedom of the ions. These interactions can be realized using Rabi flipping 
induced by either a single laser or Raman (two laser) scheme (Fig. 2). Readout 
is performed by using quantum jumps. This scheme was originally proposed by 
Cirac and Zoller in 1994 |^, and was used to demonstrate a CNOT gate shortly 
afterwards |^. 

As we can only give the briefest of description of the principles of quan- 
tum computation using cold trapped ions, the reader is recommended to peruse 
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Fig. 2. A schematic illustration of (a) single laser and (b) Raman qubit address- 
ing and control techniques. 



the more detailed descriptions which can be found elsewhere [ 1 211 :tl 1 4ITH] . In 
this paper we intend to focus on the experimental issues involved in building a 
trapped ion quantum computer. 

2 Choice of Ion 

There are three requirements which the species of ion chosen for the qubits of 
an ion trap quantum computer must satisfy: 

1. If we use the single laser scheme, the ions must have a level that is suf- 
ficiently long-lived to allow some computation to take place; this level can also 
be used for sideband cooling. 

2. the ions must have a suitable dipole-allowed transition for Doppler cooling, 
quantum jump readout and for Raman transitions (if we chose to use two sub- 
levels of the ground state to form the qubit); 

3. These transitions must be at wavelengths compatible with current laser 
technology. 

Various ions used in atomic frequency standards work satisfy the first require- 
ment. Of these ions, Ca^ offers the advantages of transitions that can be accessed 
with titanium-sapphire or diode lasers and a reasonably long-lived metastable 
state. The relevant energy levels of the A = 40 isotope are shown in fig. 3. 
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Fig. 3. The lowest energy levels of ions, with transition wavelengths and 

lifetimes listed. 



The dipole-allowed transition from the 4^S'i/2 ground state to the 4^Pi/2 
level with a wavelength of 397 nm can be used for Doppler cooling and quantum 
jump readout; The 732 nm electric quadrupole transition from the 4 ^S' 1/2 ground 
state to the 3 ‘^D ^/2 metastable level (lifetime « l.OSsec.) is suitable for sideband 
cooling. In the single laser computation scheme, the qubits and auxiliary level 
can be chosen as the electronic states 

\0) = \4^S,/2,M, = 1/2), 

|1) = 13 2^5/2, M, = 3I2), 

\aux) = |3^D5/2, Mj = -1/2). 

This ion can also be used for Raman type qubits, with the two Zeeman 
sublevels of the 4 ^S'i/ 2 ground state forming the two qubit states |0) and |1), 
with one of the sublevels of the 4 ^Ti /2 level being the upper level |2). A magnetic 
field of 200 Gauss should be sufficient to split these two levels so that they can be 
resolved by the lasers. The pump and Stokes beams would be formed by splitting 
a 397nm laser into two, and shifting the frequency of one with respect to the 
other by means of an acousto-optic or electro-optic modulator. This arrangement 
has a great advantage in that any fluctuations in the phase of the original 397nm 
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laser will be passed on to both the pump and Stokes beams, and will therefore be 
canceled out, because the dynamics is only sensitive to the difference between the 
pump and Stokes phases. One problem in realizing the Raman scheme in Ca'^ 
is the absence of a third level in the ground state that can act as the auxiliary 
state \aux) required for execution of quantum gates. This difficulty could be 
removed by using the alternative scheme for quantum logic recently proposed 
by Monroe et al. alternatively, one could use an isotope of Ca~^ which has 
non-zero nuclear spin, thereby giving several more sublevels in the ground state 
due to the hyperfine interaction; other possibilities that have been suggested 
for an auxiliary state with in the Raman scheme are to use a state of a 

phonon mode other than the CM mode or one of the sublevels of the 
doublet 



3 The Radio Frequency Ion Trap 



Radio-frequency (RF) quadrupole traps, also named “Paul traps” after their 
inventor, have been used for many years to confine electrically charged particles 
p(| (for an introduction to the theory of ion traps, see refs. [,3H32] b The classic 
design of such a Paul trap has a ring electrode with endcap electrodes above 
and below, with the ions confined to the enclosed volume. A single ion can be 
located precisely at the center of the trap where the amplitude of the RF field 
is zero. But when several ions are placed into this trapping field, their Coulomb 
repulsion forces them apart and into regions where they are subjected to heating 
by the RF field. For this reason in our experiment ions are confined in a linear 
RF quadrupole trap m- Radial confinement is achieved by a quadrupole RF 
field provided by four 1 mm diameter rods in a rectangular arrangement. Axial 
confinement is provided by DC voltages applied to conical endcaps at either end 
of the RF structure; the endcap separation is 10 mm. The design of the trap 
used in these experiments is shown in diagrammatically in Fig. 4. 

The main concerns for the design are to provide sufficient radial confinement 
to assure that the ions form a string on the trap axis after Doppler cooling; 
to minimize the coupling between the radial and axial degrees of freedom by 
producing radial oscillation frequencies significantly greater than the axial oscil- 
lation frequencies; to produce high enough axial frequencies to allow the use of 
sideband cooling [35 ; and to provide sufficient spatial separation to allow indi- 
vidual ions to be addressed with laser beams. 

4 Laser Systems 

The relevant optical transitions for Ca~^ ions are shown in Fig. 5. There are 
four different optical processes employed in the quantum computer; each places 
specific demands on the laser system. 

The first stage is to cool a small number of ions to their Doppler limit in the 
ion trap, as shown in Fig. 5a. This requires a beam at 397 nm, the 4 ^S' 1 / 2 — 4 ^Pi /2 
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Fig. 4. Side view diagram of the linear RF trap used to confine Ca^ ions in 
these experiments. The endcap separation is 10 mm and the gap between the 
RF rods is 1.7 mm. 



resonant transition. Tuning the laser to the red of the transition causes the ions to 
be slowed by the optical molasses technique m- In this procedure, a laser beam 
with a frequency slightly less than that of the resonant transition of an ion is used 
to reduce its kinetic energy. Owing to the Doppler shift of the photon frequency, 
ions preferentially absorb photons that oppose their motion, whereas they re- 
emit photons in all directions, resulting in a net reduction in momentum along 
the direction of the laser beam. Having carefully selected the trap parameters, 
many cycles of absorption and re-emission will bring the system to the Lamb- 
Dicke regime, leaving the ions in a string-of-pearls geometry. We have recently 
found ion crystals of up to five Ca'^ ions. 

In order to Doppler cool the ions, the demands on the power and linewidth 
of the 397 nm laser are modest. The saturation intensity of Ca^ ions is ~ 
10 mW/cm^, and the laser linewidth must be less than ^ 10 MHz. An opto- 
galvonic signal obtained with a hollow cathode lamp suffices to set the frequency. 
We use a Titanium:Sapphire (Ti:Sapphire) laser (Coherent CR 899-21) with an 
internal frequency doubling crystal to produce the 397 nm light. 

During the Doppler cooling, the ions may decay from the 4 ^Pi /2 state to the 
3^I?3/2 state, whose lifetime is ~ Isec. To empty this metastable state, we use 
a second Ti:Sapphire laser at 866 nm. 

Once the string of ions is Doppler cooled to the Lamb-Dicke regime, the 
second stage of optical cooling, sideband cooling, will be used to reduce the 
collective motion of the string of ions to its lowest vibrational level |2S|, illus- 
trated in Fig. 5b. In this regime, a narrow optical transition, such as the 732 
nm 4 ^S'i/ 2 — 3^D3/2 dipole forbidden transition, develops sidebands above and 
below the central frequency by the vibrational frequencies of the ions. The side- 
bands closest to the unperturbed frequency correspond to the CM vibrational 
motion. If ujq is the optical transition frequency and Ux the frequency of the 
CM vibrational motion, the phonon number is increased by one, unchanged, or 
decreased by one if an ion absorbs a photon of frequency loq + ^x, wq or ujq — uJx, 
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a) 



4^P, 



3/2 







Fig. 5. Different transitions between the levels of Ca^ ions required for (a) 
Doppler cooling, (b) Resolved sideband cooling and (c) quantum logic operations 
and readout. The single laser addressing technique has been assumed. 



respectively. Thus, sideband cooling is accomplished by optically cooling the 
string of ions with a laser tuned to ujq — to x- 

The need to resolve the sidebands of the transition implies a much more 
stringent requirement for the laser linewidth; it must be well below the CM mode 
vibrational frequency of ~ (27 t) x 200 kHz. The laser power must also be greater 
in order to pump the forbidden transition. We plan to use a Ti:Sapphire laser 
locked to a reference cavity to meet the required linewidth and power. At first 
glance it would seem that, with a metastable level with a lifetime of Is, no more 
than 1 phonon per second could be removed from a trapped ion. A second laser 
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at 866 nm is used to couple the state to the 3 ^£>3/2 state to reduce the 

effective lifetime of the D state and allow faster cooling times. The transitions 
required for realization of quantum logic gates and for readout, discussed in 
detail in sections 5.2 and 5.3, are shown in Fig. 5c. These can be performed with 
the same lasers used in the Doppler and sideband cooling procedures. 

There are two other considerations concerning the laser systems for quan- 
tum computation which should be mentioned. To reduce the total complexity of 
the completed system, we are developing diode lasers and a frequency doubling 
cavity to handle the Doppler cooling and quantum jump read out. Also com- 
plex quantum computations would require that the laser on the 4 ^5'i/2 — 3 ^£5/2 
computation transition have a coherence time as long as the computation time. 
This may necessitate using qubits bridged by Raman transitions as discussed 
above, which eliminates the errors caused by the phase drift of the laser. 

5 Qubit Addressing Optics 

In order for the Ca~^ ion qubits to be useful for actual calculations, it will be 
necessary to address the ions in a very controlled fashion. Our optical system 
for qubit addressing is shown schematically in fig. 6. 



Iniegraiing 
detector 
(to monitor 




beams 
(for cooling, 
pump-out etc.) 



Fig. 6. Illustration of the laser beam control optics system. 



There are two aspects to be considered in the design of such a system: the 
precise interactions with a single ion; and an arrangement for switching be- 
tween different ions in the string. In addition to the obvious constraints on laser 
frequency and polarization, the primary consideration for making exact tt- or 27 t- 
pulses is control of the area (over time) of the driving light field pulse. The first 
step toward this is to stabilize the intensity of the laser, as can be done to better 
than 0.1%, using a standard “noise-eater”. Such a device typically consists of an 
electro-optical polarization rotator located between two polarizers; the output 
of a fast detector monitoring part of the transmitted beam is used in a feedback 




434 



D.F.V. James et al. 



circuit to adjust the degree of polarization rotation, and thus the intensity of the 
transmitted light. Switching the light beam on and off can be performed with 
a similar (or even the same) device. Because such switches can possess rise/fall 
times on the scale of nanoseconds, it should be possible to readily control the 
area under the pulse to within ~ 0.1%, simply by accurately determining the 
width of the pulse. A more elaborate scheme would involve an integrating detec- 
tor, which would monitor the actual integrated energy under the pulse, shutting 
the pulse off when the desired value is obtained. 

Once the controls for addressing a single ion are decided, the means for 
switching between ions must be considered. Any system for achieving this must 
be fast, reproducible, display very precise aiming and low “crosstalk” (i.e. overlap 
of the focal spot onto more than one ion), and be as simple as possible. In 
particular, it is desirable to be able to switch between different ions in the string 
in a time short compared to the time required to complete a given 7r-pulse on one 
ion. This tends to discount any sort of mechanical scanning system. Acousto- 
optic deflectors, which are often used for similar purposes, may be made fast 
enough, but introduce unwanted frequency shifts on the deviated beams. As a 
tentative solution, we propose to use an electro-optic beam deflector, basically 
a prism whose index of refraction, and consequently whose deflection angle, is 
varied slightly by applying a high voltage across the material; typical switching 
times for these devices is 10 nanoseconds, adequate for our purposes. One such 
device produces a maximum deflection of ±9 mrad, for a ±3000V input. The 
associated maximum number of resolvable spots (using the Rayleigh criterion) 
is of order 100, implying that ~ 20 ions could be comfortably resolved with 
negligible crosstalk. 

After the inter-ion spacing has been determined, i.e., by the trap frequencies, 
the crosstalk specification determines the maximum spot size of the addressing 
beam. For example, for an ion spacing of 20 /xm, any spot size (defined here as 
the 1/e^ diameter) less than 21.6 /xm will yield a crosstalk of less than 0.1%, 
assuming a purely Gaussian intensity distribution (a good approximation if the 
light is delivered from a single-mode optical fiber, or through an appropriate spa- 
tial Alter). In practice, scattering and other experimental realities will increase 
this size, so that it is prudent to aim for a somewhat smaller spot size, e.g. 10 
/xm. One consideration when such small spot sizes are required is the effect of 
lens aberrations, especially since the spot must remain small regardless of which 
ion it is deflected on. Employing standard ray-trace methods, we have found 
that the blurring effects of aberrations can be reduced if a doublet /meniscus 
lens combination is used (assuming an input beam size of 3mm, and an effec- 
tive focal length of 30mm). A further complication is that, in order to add or 
remove phonons from the system, the addressing beams must have a component 
along the longitudinal axis of the trap. The addressing optics must accommo- 
date a tilted line of focus, otherwise the intensity at each ion would be markedly 
different, and the crosstalk for the outermost ions would become unacceptable. 
According to ray-trace calculations, adding a simple wedge (of ~ 2°) solves the 
problem and this has been confirmed by measurements using a mock system. 
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Depending on the exact level scheme being considered, it may be necessary 
to vary the polarization of the light. Because the electro-optic deflector requires 
a specific linear polarization, any polarization-control elements should be placed 
after the deflector. The final result is a highly directional, tightly- focused beam 
with controllable polarization and intensity. 

6 Imaging System 

In order to determine the ions’ locations and to readout the result of the quan- 
tum computations, an imaging system is required. Our current imaging system 
consists of two lenses, one of which is mounted inside the vacuum chamber, and 
a video camera coupled to a dual-stage micro-channel plate (MCP) image in- 
tensifler. The first lens with focal length 15 mm collects the light emitted from 
the central trap region with a solid angle of approximately 0.25 sr. The image 
is relayed through a 110mm/f2 commercial camera lens to the front plate of the 
MCP. This set-up produces a magnification of 7.5 at the input of the MCP. The 
input of the 110 mm lens is fitted with a 400nm narrow band Alter to reduce 
background from the IR laser and from light emanating from the hot calcium 
oven and the electron gun filament. 

The dual plate intensifler is operated at maximum gain for the highest pos- 
sible sensitivity. This allows us to read out the camera at normal video rate of 
30 frames s“^ into a data acquisition computer. Averaging and integrating of 
the signal over a given time period can then be undertaken by software. We And 
this arrangement extremely useful in enabling us to observe changes of the cloud 
size or the intensity of the fluorescence with changes of external parameters like 
trapping potential, laser frequency, laser amplitude, etc. in real time. 

The spatial resolution of the system is limited by the active diameter of indi- 
vidual channels of the MCP of approximately 12 /xm. Since the gain is run at its 
maximum value cross talk between adjacent channels in the transition between 
the first and second stage is to be expected. This results in the requirement that 
two incoming photons can only be resolved when they are separated at the input 
of the MCP by at least two channels, i.e. by 36 /xm in our case. With the mag- 
nification of the optical system of 7.5 this translates into a minimum separation 
of two ions to be resolved of 5 /xm, which is well below the separation of ions in 
the axial well of about 25 /xm expected in our experiment. 

7 Summary 

It is our contention that currently the ion trap proposal for realizing a practical 
quantum computer offers the best chance of long term success. This in no way is 
intended to trivialize research into the other proposals: in any of these schemes 
technological advances may at some stage lead to a breakthrough. In particular. 
Nuclear Magnetic Resonance does seem to be a relatively straightforward way in 
which to achieve systems containing a few qubits. However, of the technologies 
which have so far been used to demonstrate experimental logic gates, ion traps 
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seem to offer the least number of technological problems for scaling up to lO’s 
or even lOO’s of qubits. 

In this paper we have described in some detail the experiment we are cur- 
rently developing to investigate the feasibility of cold trapped ion quantum 
computation. We should emphasize that our intentions are at the moment ex- 
ploratory: we have chosen an ion on the basis of current laser technology, rather 
than on the basis of which ion which will give the best performance for the 
quantum computer. Other species of ion may well give better performance: In 
particular Beryllium ions do have the potential for a significantly lower error 
rate due to spontaneous emission, although it is also true that lighter ions may 
be more susceptible to heating. Other variations, such as the use of Raman 
transitions in place of single laser transitions, or the use of standing wave lasers 
need to be investigated. Our choice of Calcium will allow us to explore these 
issues. Furthermore, calculations suggest that it should be possible to trap 20 
or more Calcium ions in a linear configuration and manipulate their quantum 
states by lasers on short enough time scales that many quantum logic operations 
may be performed before coherence is lost. Only by experiment can the theo- 
retical estimates of performance be confirmed rCTTTI . Until all of the sources of 
experimental error in real devices are thoroughly investigated, it will be impos- 
sible to determine what ion and addressing scheme enables one to build the best 
quantum computer or, indeed, whether it is possible to build a useful quantum 
computer with cold trap ions at all. 
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Abstract. A new microscopic trap, the elliptical ion trap, can coniine 
more than 10® ions with minimal micromotion, due to previously unrec- 
ognized properties of one-dimensional Coulomb crystals. Arrays of ellip- 
tical ion traps provide a physical model of a parallel quantum computer, 
consisting of thousands of microscopic Cirac-Zoller processors, each con- 
taining approximately 100 ions, maintained in entangled states by quan- 
tum optical interconnects. Physical properties of a trap array containing 
4 X 10® qubits are derived. 



Cirac and Zoller have recently proposed a model of a quantum computer m 
based on a linear ion trap. A row of ions are confined along the z-axis of the 
trap and laser cooled to the ground vibrational state. The lowest center-of-mass 
(c.m.) vibrational phonon of the ion array couples to the internal atomic states 
(the quantum bits or qubits) via Doppler shifted laser excitation. This phonon 
is used as a quantum communication channel to transmit superposition states 
between the atoms and carry out quantum logic. The Cirac-Zoller model contains 
all the interactions required for an evaluation of Shor’s algorithm|3| and has 
been studied in sufficient detail P| that it is often treated as a paradigm for the 
practicality of quantum computing in general. Several groups are constructing 
small scale tests of this model. 

The two currently known quantum algorithms, Shor’s factorizing algorithm 
and Grover’s quantum database search|S|, both require large numbers of qubits 
to gain an advantage over classical methods. Factorization of a 400 bit number 
has been estimated ^ to require > 2000 ions in the Cirac-Zoller model without 
quantum error correction (QEC) and from 10^ to 10® ions with QEC jn|. Similarly, 
Grover’s algorithm searches N unordered memory elements in only '/N steps, 
and cannot compete with classical methods unless N is very large. However, 
experience with current linear ion traps is limited to « 10^ ions[Z| rather than 
the 10^ to 10® needed. 

The paper introduces a new trap, the elliptical ion trap, which overcomes 
some of the limitations of the linear ion trap. It has three important features. 
First, it acts like a linear trap for more than 10® ions, so that the c.m. phonon 
can be used for quantum logic as before. Second, the trap requires only a single 
flat electrode of microscopic dimensions so that trap arrays can be microfabri- 
cated at a density of > 10^ traps/cm^. Third, the size and geometry of the trap 
are consistent with previously proposed quantum optical inteconnects jblPIHllij . 
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since the trap is small enough to be enclosed by an optical cavity which can sat- 
isfy the ’’strong-coupling” condition of cavity quantum electrodynamics. The 
intention is that both the trap and the optical interconnects be microfabricated 
in large arrays, in analogy to conventional microprocessors. This model pictures 
a quantum computer as consisting of thousands of microscopic Cirac-Zoller pro- 
cessors, each containing approximately 100 ions, maintained in entangled states 
by quantum optical interconnects. Moreover, the trap array supports a parallel 
architecture in which the n bits of a large quantum word reside in n separate 
traps, as discussed in more detail below. 




(b) 



Fig. 1. An ideal elliptical ion trap, consisting of three elliptical hyperboloids. 
It generates the potential of Eq. 1 throughout the interior volume, since the 
electrodes lie on equipotential surfaces. The ellipticity e = ^o/s^o- In practice the 
simpler electrode of Fig. 4 suffices. 




The elliptical trap, shown in Fig. 1, is a generalized ion trap which includes 
both the Paul trap^^l and the linear trap^Hl as special cases. It is defined by 
the potential 



V{x,y,z) 



2 2 

(I4c + Vocos{f2t)){^ - 
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The axes are named in agreement with the linear trap convention rather than 
the Paul trap convention. This is the most general quadratic potential satisfying 
= 0, which requires \jx^ + \jz^ = l/j/o^ • Such a potential can be 
created by hyperboloidal electrodes which form ellipses in the x — z plane, where 
xo and zq are the minor and major axes of the ellipse at j/ = 0, while j/o is the 
distance to the endcap electrode at x = z = 0. Solution of Newton’s law for the 
above potential leads to three Mathieu equations II 41151 for the trapped ion’s 
coordinates u = x,y,z 



(Pu 

dr 



2 + ttu [au - 2 qu cos(2r)] u = 0 



(2) 



where = 1 for u = x,z and = —1 for u = y, the normalized time r = f2tj2, 
the Mathieu parameter (a dimensionless trap strength) is 



4eVj] 

mQ'^up ’ 



( 3 ) 



m is the mass of the particle, 17/27T the rf drive frequency, and uq = xo,yoi-Zo- 
An identical equation holds for au with Vb replaced by 2Vdc- = 0 implies 
the relation qx + qz = qy and Oa, + Uz = Uy Define the ellipticity e = zqIxq to be 
ratio of the major to minor axes of the ellipse which yields qz = qy / {P + 1) and 
dx = dy^ I [P' + !)• In the low q limit {q < .2) where most traps are operated[E| 
the trap secular frequencies iOu — > du^l2^2 and therefore obey 



iOz = OJ. 



y ,2 



+ 1 



and 



^^2 + 1 - 



( 4 ) 



( 5 ) 



The Paul trap is a special case of e = 1 where ojx = oJz = tOyj2, while the linear 
trap is a limiting case of e ^ oo where Wz — i- 0 and ojx — ^ Although it is 

well-known that the linear trap is a limiting case of a stretched circular traolT^. 
and although the general form of Eq. 1 was discussed by Paul^J many years 
ago, the regime 1 < e < oo has not previously been considered in detail. 

To play the role of a linear trap in the Cirac-Zoller model, an elliptical trap 
must first confine the ions on the z-axis so that the c.m. phonon can act as a 
quantum communications channel and second have sufficiently small micromo- 
tion so that the resulting Doppler shifts do not decouple the ions from the lasers. 
Schiffer |Tbf1 7j . Diibin |TRl I NIJ . and others have shown through analytic and nu- 
merical studies of ion crystals in asymmetric harmonic potentials that the ions 
will stay on the z-axis providing 



mLoi 



( 6 ) 



where d is the ion-ion separation at the center of the trap and a = 7C(3)/2 = 
4.207, where C, is the Riemann (^-function. For d < do the ions undergo what 
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is called the ’’zig-zag transition”, where alternate ions move off the z-axis in 
opposite directions. Note that Eq. 6 does not depend directly on the axial trap 
strength coz, but only on the transverse trap strength Ux- No exact analytic 
expression for d exists for all N but several accurate approximations have been 
derived. Equating Steane’s 0 result d = 2.0soN~^-^'^ to do in Eq. 6, where the 
scale length Sq = yields the relation 

N{e) = (7) 

which determines the capacity of a trap of given ellipticity e to hold a number 
N{e) of ions on the z-axis. See Fig. 2. 




1 2 5 10 20 



8 

Fig. 2. A plot of Eq. 7, showing N{e), the maximum number of ions which can be 
confined on the z-axis before the zig-zag transition, as a function of the ellipticity 
e. Note that N{e) is independent of all other trap parameters. 



For N > N{e) the ions assume the zig-zag structure while for N < N{e) the 
ions are farther apart than the minimum value do- Note that Eq. 7 is independent 
of all trap parameters except the ellipticity. 

The second requirement is that the elliptical trap confine large crystals with 
minimal micromotion, that is, less than A/27 t, where A is the wavelength of the 
laser light. It has been assumed previously that only linear ion traps, which have 
no r.f. field along the z-axis, can provide small enough micromotion for quantum 
computation. However, a detailed calculation of micromotion amplitudes shows 
that a properly designed elliptical trap can limit micromotion oscillations to a 
few tens of nm, even for large ion crystals containing more than 1000 ions and 
even though r.f. fields are used for all confinement. Micromotion [EED] is the 
oscillatory motion of the ions at the rf drive frequency f2 and in the low q limit 
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is given by Hz{t) = zqzCos{fH) /2 where z is the time-averaged (secular) position 
of the ion. The largest micromotion occurs at the ends where z = /z/2, where Iz is 
the total length of the ion chain. The theories of Dubin and Shiffer assume a static 
harmonic potential (the pseudopotential approximation) which yields accurate 
values of Iz but gives no information about micromotion. These results may be 
summarized by the approximation = d{N — l)i °53 -^^hich agrees within 10% 
with numerical results The micromotion may then be estimated using 

Steane’s approximation for d yielding fiz{N) = qzSo{N — which 

approximates qzSQ'/N /2 as N —> oo. However, a much lower value of can be 

attained by matching the ellipticity to N. Consider a series of traps which are 
identical except for their ellipticity and assume that each trap contains N{e) ions 
in accordance with Eq. 7. This minimizes micromotion for a given N by using 
the weakest axial confinement which will keep the ions on the z-axis. Using Eq. 
4,5,6, and 7 yields 



.. .. ^o^ 1.985(A^- l)i°53 

/iz(fV) - MU2)^ 5ggj,^ .733AT-854)1/3 



(8) 



where /iz(2) is the micromotion of 2 ions in a trap with e = 1. In the large N 
{N > 20) limit ^z(fV) ^ 2.20iV-^°^z(2), that is, it depends approximately on 
the fifth root of N. This weak dependence is shown in Fig. 3 and permits trap 
parameters to be chosen so that micromotion is of little significance throughout 
the range N=2 to 1000, as shown in an example below. 




1 5 10 50 100 500 1000 



N 



Fig. 3. The maximum micromotion amplitude fXz in a linear crystal of N ions, 
Eq. 9. Note the weak dependence on N, which approximates at large N, 
due to previously unrecognized properties of 1-dimensional Coulomb crystals. 
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This result is unexpected since in three dimensions the micromotion ampli- 
tude increases approximately as due to the constant ion density [T!^. Why 

does one-dimensional confinement result in a weaker power law of The 

answer is that in one dimension the density increases with N, due to more effec- 
tive cancellation of the Coulomb repulsion of neighboring ions. A related result 
is that traps filled according to Eq. 7 all have the same value of d = do, despite 
the reduction of the axial trap force with increasing e. 

An elliptical trap can be constructed from single flat electrode, for example, 
an elliptical aperture in a conducting surface as shown in Fig. 4, which is straight- 
forward to microfabricate in arrays by photolithography. This is in contrast to a 




Fig. 4. An ” aperture trap” consisting of an elliptical hole in a conducting surface 
which generates the potential of Eq. 1 near the z-axis. For the case considered 
in the text trap arrays can be generated a density of > 100 traps/cm^. 



linear trap, which consists of a three dimensional array of four segmented rods [ 7 ]. 
The elliptical aperture generates a potential which reduces to Eq. 1 on the z- 
axis, where the ion crystal resides. The theory of such ’’aperture traps” has been 
discussed in detail[22j for the circular case and they have already been used in 
quantum optics |231 and quantum logic experiments |25 • In th© circular case the 
trap parameters can be derived as coefficients in the Legendre expansion of the 
potential[22|, but for e > 1 the corresponding power series contains ellipsoidal 
harmonics, which are rarely used. A three-dimensional numerical routine has 
therefore been used to estimate the three relevant parameters of the potential: 
the efficiency, the ellipticity, and the anharmonic terms. Potentials have been 
computed for apertures with ellipticities of 1,3,5, and 10, corresponding to N{e) 
of 2, 18, 60, and 300 in Eq. 7. The efficiency is defined as E = VojVt, where Vt is 
the potential applied to the conducting surface. The routine yields .05 < E < .10 
depending on the thickness of the electrodes. The ellipticity of the potential is 
found to be equal to the ellipticity of the aperture within the accuracy of 20% 
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and the anharmonic terms were found to be comparable to the circular ca,se|22|. 
Spefically for a 100 fim by 300/im aperture, the z-axis fourth order (anharmonic) 
term was less than 1% of the second order term along a central lOO/rm region of 
the z-axis. It is not apparent from numerical work whether an elliptical aperture 
produces the most accurate approximation to Eq. 1, since simpler apertures such 
as rectangles produce similar fields. 

The practicality of elliptical traps can be demonstrated by considering an 
example based on recent experiments. Consider first an e = 1 trap of 100 p,m 
radius, similar to ref. [23E31- To separately excite each ion for quantum logic the 
ion-ion spacing dp niust be greater than A. Choosing dp = 4 /rm yields an axial 
oscillation frequency « 1 MHz at an applied potential Vt=1000 V rms 

(efficiency E=.08) for the ion Ba 137, currently under study. The Mathieu pa- 
rameter ^2 = 2'\/2wz/‘!^ = 0.01 for i7/27r = 282 MHz. This yields a micromotion 
[iz = 10 nm for a two ion crystal. Now assume that for quantum computing we 
wish to confine 64 ions. Eq. 7 yields an optimum ellipticity e = 5.2 which de- 
termines (jz = 7.1 X 10“^. The crystal length Zz/2 « 32do = 128 microns, which 
gives /iz = 45 nm at the ends of the crystal and proportionately less near the 
center. The Doppler sidebands will therefore have an intensity of (/3) < -06, 
where /3 = 27rp.z/A. A larger /iz has been used in a recent experiment which ob- 
served interference and superradiance of two trapped ions|23|. The effect of such 
small micromotion amplitudes on quantum computing is therefore expected to 
be minimal. 



The above 64-qubit trap occupies an area of < 2 x 10“^ cm^ and may be 
placed in an array at a density of ~ 100 traps/cm^, yielding a bit density of 6400 
qubits/cm^. A 30 cm diameter disk could therefore support 64,000 traps holding 
4 X 10® qubits. Such a disk driven at 1000 V rms at 282 MHz will dissipate only 
3 watts, assuming a Q=5000. The laser power required to excite quantum logic 
in large arrays is also nominal, since recent experiments have saturated single 
ions with « 10“^ watts of resonant light. 



Quantum optical interconnects are essential for transferring quantum infor- 
mation between separate traps in an array. Recent theoretical work has proposed 
a variety of techniques [8I0I10I1'I) which transfer a quantum state between two 
distant atoms. These methods typically assume the ’’stong coupling” condition 
of cavity quantum electrodynamics > which requires that the cavity mode vol- 
ume Vm obey Vm < Vr, where Vr = <JQc/r is the atomic radiative volume. 



iTo = 3A^/27 t, r is the atom’s spontaneous emission decay rate, and c is the 
speed of light. Since Vr « (lOO/rm)® for ions of interest, cavity lengths must be 
less than one mm. This is consistent with the elliptical aperture traps of Fig. 
4 since their two-dimensional structure permits dielectric mirrors to be brought 
within several hundred microns of the trap center. It would of course be desirable 
to also microfabricate the cavity mirrors in arrays, and one can imagine a three- 
disk ’’sandwich” structure in which the central disk contains the trap arrays and 
two outer integrated optical disks contain mirror arrays for the quantum optical 
interconnects. 
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This work was motivated by the issue of parallel architecture in quantum 
computation, specifically whether the very large number of gate operations (10^^) 
required by the modular exponentiation^ in Shor’s algorithm can be attributed 
to the serial architecture of the Cirac-Zoller model. Note that it is important to 
distinguish between quantum parallelism (entanglement) as defined by Deutsch, 
which is common to all quantum computers, and architectural parallelism, which 
is the simultaneous use of multiple quantum channels. The Cirac-Zoller model 
is serial since it relies on a single quantum channel, the c.m. phonon, for all 
communications. Preskill et. al. evaluate the modular exponentiation as series 
of repeated multiplications, which are in turn formed from repeated additions, 
again formed from repeated single-bit operations. This choice is constrained by 
the serial architecture. Can parallel algorithms evaluate the modular exponenti- 
ation with far fewer gates, thereby making decoherence less of an issue? Parallel 
quantum logic can also increase the execution speed (a limitation of the Cirac- 
Zoller model), in the classical case typically by a factor of nlogn , and is needed 
as well for quantum error correction jS]. Arrays of elliptical ion traps provide both 
a model system for the analysis of such issues and an avenue for experimental 
work. An elliptical trap is under construction in our laboratory, for use with 
laser-cooled Ba 137 ions. 

I have benefited from many helpful comments on the manuscript from C. 
Kurtsiefer and D. DiVincenzo. 
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Abstract. A Quantum Computer is a new type of computer which can 
solve problems such as factoring and database search very efficiently. 
The usefulness of a quantum computer is limited by the effect of two 
different types of errors, decoherence and inaccuracies. In this paper we 
show the results of simulations of a quantum computer which consider 
both decoherence and inaccuracies. We simulate circuits which factor 
the numbers 15, 21, 35, and 57 as well as circuits which use database 
search to solve the circuit satisfaction problem. Our simulations show 
that the error rate per gate is on the order of 10“® for a trapped ion 
quantum computer whose noise is kept below 7 t/ 4096 per gate and with 
a decoherence rate of 10~®. This is an important bound because previous 
studies have shown that a quantum computer can factor more efficiently 
than a classical computer if the error rate is of order 10“®. 



Keywords: Quantum Simulation, Ion Trap, Factoring, Database Search 

1 Introduction 

A quantum computer consists of atomic particles which obey the laws of quan- 
tum mechanics |T’uHoH!t] |Lloy95| . The complexity of a quantum system is expo- 
nential with respect to the number of particles. Performing computation using 
these quantum particles results in an exponential amount of calculation in a 
polynomial amount of space and time |Feyn85| [DeutSh] . This quantum paral- 
lelism is only applicable in a limited domain. Prime factorization is one such 
problem which can make effective use of quantum parallelism |ShorTP] . This is 
an important problem because the security of the RSA public-key cryptosystem 
relies on the fact that prime factorization is computationally difficiilt [KiSA78) . 

Errors limit the effectiveness of any physical realization of a quantum com- 
puter. A quantum computer is subject to two different types of errors, decoher- 
ence and inaccuracies. Decoherence occurs when a quantum computer interacts 
with the environment. This interaction destroys the quantum parallelism by 
turning a quantum calculation into a classical one. The other type of error, in- 
accuracies in the implementation of gate operations, accumulates over time and 
destroys the results of the calculation. 

In this paper we show results of simulations of a quantum computer which 
is subject to both decoherence and inaccuracies. These simulations assume the 
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trapped ion model of a quantum computer proposed by Cirac and Zoller [niZo9,'T| . 
We study Shor’s factorization algorithm by simulating circuits which factor the 
numbers 15, 21, 35, and 57 |Shor93) . We also simulate Grover’s database search 
algorithm with a circuit which solves the circuit satisfaction nroblem | |( lrov96| . 
The rest of this section gives a brief overview of quantum computers. 



1.1 Qubits and Quantum Superposition 

The basic unit of storage in a Quantum Computer is the qubit. A qubit is like 
a classical bit in that it can be in two states, zero or one. The qubit differs 
from the classical bit in that, because of the properties of quantum mechanics, 
it can be in both these states simultaneously (EESSEl- A qubit which contains 
both the zero and one values is said to be in the superposition of the zero 
and one states. The superposition state persists until we perform an external 
measurement. This measurement operation forces the state to one of the two 
values. Because the measurement determines without doubt the value of the 
qubit, we must describe the possible states which exist before the measurement 
in terms of their probability of occurrence. These qubit probabilities must always 
sum to one because they represent all possible values for the qubit. 

The quantum simulator represents the qubits of the computer using a com- 
plex vector space. Each state in the vector represents one of the possible values 
for the qubits. The bit values of a state are encoded as the index of that state 
in the vector. The simulator represents each encoded bit string with a non zero 
amplitude in the state vector. The probability of each state is defined as the 
square of this complex amDlitude fEeLSti^ . For a register with M qubits, the 
simulator uses a vector space of dimension 2^ . 



1.2 Quantum Transformations and Logic Gates 

A quantum computation is a sequence of transformations performed on the 
qubits contained in quantum registers |Eeyn85| | Ha, He9^ . A transformation takes 
an input quantum state and produces a modified output quantum state. Typ- 
ically we define transformations at the gate level, i.e. transformations which 
perform logic functions. The simulator performs each transformation by multi- 
plying the 2^ dimensional vector by a 2^ x 2^ transformation matrix. 

The basic gate used in quantum computation is the controlled-not, i.e. exclu- 
sive or gate. The controlled-not gate is a two bit operation between a control bit 
and a resultant bit. The operation of the gate leaves the control bit unchanged, 
but conditionally flips the resultant bit based on the value of the control bit. 

1.3 The Ion Trap Quantum Computer 

The ion trap quantum computer as proposed by Cirac and Zoller is one of the 
most promising schemes for the experimental realization of a quantum com- 
puter mm- Several experiments have demonstrated simple quantum gates 
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mmm- Laser pulses directed at the ions in the trap cause trans- 
formations to their internal state. A common phonon vibration mode is used to 
communicate between the ions in the trap. A controlled-not gate is a sequence 
of laser pulses. We use the ion trap quantum computer as the model for our 
quantum simulator. 



Qubits in the Ion Trap Quantum Computer. Qubits are represented using 
the internal energy states of the ions in the trap. The ion trap represents a logic 
zero with the ground state of an ion, and a logic one with a higher energy 
state. The ion trap quantum computer also requires a third state which it uses 
to implement the controlled-not gate. In this paper we use a simplified model 
which, instead of using a third state for each qubit, uses a single third state which 
is shared amongst all the qubits. This simplified model reduces the simulation 
complexity exponentially without an appreciable loss of accuracv pbPe97a,| . 



Transformations in the Ion Trap. An operation in the ion trap quantum 
computer is a sequence of laser pulses. Each laser pulse is defined by one of the 
transformation matrices shown in equation Q ^ corresponds to the duration of 
the laser pulse and (f) corresponds to the phase. A two bit controlled-not gate is 
a sequence of five laser pulses, two V and three U transformations. A single bit 
not gate can be implemented with three laser pulses and the three bit controlled- 
controlled-not gate requires seven laser pulses. 



U = 



10 0 0 

0 cos I sin I 0 

0— ie®‘^sin| cos | 0 

0 0 0 1 



V = 



cos I —ie sin | 
— sin I cos | 



( 1 ) 



2 Simulating a Quantum Computer 

Our quantum computer simulator simulates circuits at the gate level. The simu- 
lator implements one, two and three bit controlled-not gates as well as rotation 
gates. The simulator implements each gate as a sequence of laser pulses, and 
represents the entire vector space throughout the simulation. 



2.1 Operational Errors and Decoherence 

The simulator models inaccuracies by adding a small deviation to the two angles 
of rotation 0 and (f>. Each operational error angle is drawn from a gaussian 
distribution with a parametrized mean (/r) and standard deviation (u). Errors 
with non zero /r, called mean errors, correspond to systematic calibration errors, 
and errors with non zero cr, referred to as standard deviation errors, correspond 
to noise in the laser apparatus. 
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Because the phonon mode is coupled to all the qubits in the computer, it is 
the largest source of decoherence| |MoMe05| . For this reason we only model the 
phonon decoherence and not the decoherence of the individual qubits. 

We model the decoherence of the phonon mode by performing an additional 
operation after each laser pulse. Equation 0 shows this transformation which 
has the effect of decaying the amplitude of the states in the phonon state. This 
decay transformation is based on the quantum jump aDDroach |( larmh,^ . The 
decay parameter (dec) remains constant throughout the entire simulation. 

|'0 > |0 , IV" > |0 >p 

This method of modeling decoherence implicitly models spontaneous emission. 
Because the state is never renormalized, the total norm at each step represents 
the probability that the calculation survives up to that point without a sponta- 
neous emission occurring. An alternative method for modeling decoherence is to 
renormalize the state at each step and then, based on a probability of emission, 
cause emissions at different points in the calculation. This method has the dis- 
advantage that, because we cause emissions at random points in the calculation, 
we must run multiple simulations each with different initial random seeds to 
average out any bias caused by the random number generator. We have shown 
however that both methods for modeling decoherence give essentially the same 
results |D bDef)7a) . 

Because the simulator applies the decoherence transformation once per laser 
pulse, the parameter dec has units of (decoherence/laser pulse). To convert these 
units to decoherence per unit time we must consider the switching time of the 
laser. The dec parameter is simply the switching speed divided by the decoher- 
ence time. Recent experiments show switching speeds of 20kHz for a controlled- 
not gate, i.e. four tt pulses, and a decoherence rate of a few kHz |MolVief)!Tf . This 
corresponds to a decoherence parameter value between 10“^ and 10“^. 



2.2 Quantum Circuits 

Much of the current interest in quantum computation is due to the discovery 
of an efficient algorithm by Peter Shor to factor numbers jShorOd] . By putting 
the qubit register A in the superposition of all values and calculating the func- 
tion f(A) = X^modN^ a quantum computer calculates all the values of f(A) 
simultaneously. Where N is the number to be factored, and X is a randomly 
selected number which is relatively prime to N. The quantum factoring circuit 
also contains operations to create the superposition state at the beginning of the 
circuit, and extract the period at the end of the circuit. The circuit to calculate 
f(A) can be performed in O(L^) time using repeated squaring |Desp96| , i.e. a 
sequence of multiplications performed modulo N . 

Grover’s database search algorithm searches for a key, from a set of match- 
ing keys, in an unsorted d ata,ba,se |( 1 rov9^ . The keys are defined by a function 
which can be evaluated in unit time. After evaluating this function a diffusion 
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transformation is performed which amplifies the probability in the states with 
matching keys. Grover shows that after performing 0{^/{N)) evaluation steps 
and diffusion transformations the probability of measuring a matching key is 
greater than 1/2. 

3 Simulation Results 

In this section we study how decoherence and operational errors degrade the 
fidelity of the factoring and database search algorithms. 

The fidelity, as defined by fidelity = || < (plif > p, measures how close a 
state with error in it is to the correct result. The fidelity is defined as the inner 
product between the simulation with errors (V’) and the correct result ((/?). 

Tabled shows all the benchmarks used in our simulation studies. To show 
the complexity of simulating these benchmarks we show the simulation time for 
each. This time assumes a single 300 MHz processor and the simulations include 
only operational error. All the factoring benchmarks were run using a parallel 
version of the simulator |( ) bPeOS) . We have run simulations on as many as 256 
processors, and the simulator achieves near linear speedup. 



Table 1. Benchmarks used in simulation studies 



Benchmark 


Number of 
qubits 


Number of 
laser pulses 


Description 


Simulation 
time (secs) 


grover 


13 


1,838 


Circuit SAT using the Grover 
database search algorithm 


10 


mult 


16 


8,854 


One modulo multiply step 
from the factor- 15 problem 


282 


factorl5 


18 


70,793 


Factor- 15 problem using 
3 qubits for A 


10,465 


factor21 


24 


69,884 


Factor-21 problem using 
6 qubits for A 


272,276 


factor35 


27 


99,387 


Factor-35 problem using 
6 qubits for A 


3,083,520 


factors? 


27 


97,939 


Factor-57 problem using 
6 qubits for A 


3,067,853 



Because of round off error, in the factoring algorithm, the choice of the num- 
ber of bits to use in the A register affects the probability seen after performing 
the FFT |Shor94j . Shor suggest using 2L+1 bits for an L bit factorization, but 
for the numbers used in our studies we can use less. For the factorl5 problem the 
period is a power of two, i.e. four, and therefore there is no round off error. For 
the numbers 21, 35 and 57, the probability given by the FFT does not increase 
for more than six bits. Also we are mainly concerned with observing the fidelity 
for these circuits, and the fidelity is always calculated before the FFT. 
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3.1 Operational Errors 

In this section we consider operational errors without any decoherence. We first 
investigate the significance of errors in the angle </), by varying the amount of 
error in the angles 9 and (j) separately. In all other simulations we vary the angles 
9 and cj) together. 

To average the random bias out of simulations with non zero a we run mul- 
tiple simulations each with different initial random seeds. To get a single sim- 
ulation point we run at least four simulations, and in many cases we run more 
to establish upper and lower confidence intervals for the average of the ending 
fidelities [IHilVIoisn] . For simulations which include errors in the angle 9 we run 
enough simulations so that the average of the fidelities is within 0.02 of the ac- 
tual mean of the distribution with an upper and lower confidence of 95%. There 
is more variability in the fidelity for simulations which consider only cj) errors, so 
for these we obtain the 95% confidence intervals to within 0.03 of the mean. 



Significance of Errors in the Angle 4>. Fig. Q] shows how operational errors 
degrade the fidelity for the factoiT5 benchmark. In Fig. ^a) we vary the mean 
and standard deviation and introduce both 9 and (j) errors. In Fig. mb) we only 
introduce 9 errors. Comparing the two graphs shows that the combination of 9 
and (j) errors produces a lower fidelity than 9 errors alone. 



(a) Factorl5, 0 and ([) errors, No Decoherence (b) Factorl5,6 Errors only, No Decoherence 





Fig. 1. Inaccuracies for factoiT5 (a) 9 and </> errors (b) 9 errors (c) 4> errors (d) 
9 and (j) errors, 9 errors, (j) errors 
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Fig.CKc) shows the effect of only </> errors. As the graph shows noise degrades 
the fidelity, but adding a constant amount of error has no effect. Fixed magnitude 
(j) errors have no effect because the laser transformations are always performed 
in pairs, and an error in the second transformation cancels an error in the first 
transformation. In Fig. m) we compare the effect of 6 and 4> errors alone and 
their combined effect. The highest degradation occurs when considering both 9 
and (j) errors, and 9 errors produce a more significant effect than (j) errors. 

Operational Errors for the Grover Benchmark. Fig. [3 shows mean and 
standard deviation operational errors for the grover benchmark. The figures show 
the probability of finding a correct key for twelve iterations of the algorithm. 
When running the database search it is important to stop after the correct iter- 
ation, because the probability decreases if we run too many iterations [Ijoljr}??)) . 
For our case the highest probability occurs after the eighth iteration. 



(a) Grover, |i = 0, No Decoherence (b) Grover, G = 0, No Decoherence 





Fig. 2. Inaccuracies for the grover benchmark, (a) a Errors, (b) /r errors 



Fig. 0(a) shows that the peak probability is above 0.5 for a errors as great 
as tt/ 128, and for an error rate of 7 t/ 64 the peak probability is 0.2. Also for 
cr errors the peak probability occurs after the same iteration for all errors less 
than 7 t/ 128. However /r errors, as shown in Fig. 0(b), shift the peak so that it 
occurs at an earlier iteration. The peaks are higher for /r errors than they are 
for tr errors with the same level of error. However, since we can only perform a 
single measurement, the shift in the peak values causes a further reduction in 
the probability if we measure after the eighth iteration. 

Fidelity at Intermediate Points for the Factoring Benchmarks. Fig. 0 
shows the fidelity at intermediate points in the calculation for the factor21, and 
factors? benchmarks. 

Standard deviation errors produce a more significant effect than fj, errors of 
the same magnitude. This is due to a cancellation effect for mean errors which 
is very similar to the cancellation effect exhibited by cf) errors jcThPeDTa] . This 
cancellation effect occurs because of the nature of reversible computation and 
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(a) Factor21, a = 0, No Decoherence (b) Factor21, = 0, No Decoherence 




Fig. 3. Fidelity as a function of the number of gates for factor21 and factors? 



causes the fidelity to go through periods where it increases. Because we perform 
operations in pairs, where the two operations are inverses of each other, an 
error in one operation may reverse the error from an earlier operation. Standard 
deviation errors also exhibit this effect but it is reduced because we average the 
results from multiple simulations. 



3.2 Decoherence Errors 

Fig .0shows how the fidelity of a computation decreases over time in the presence 
of decoherence. Fig. 2|shows the fidelity at intermediate points in the calculation 
for the four factoring benchmarks. For small amounts of decoherence, i.e. 10“® or 
less, the fidelity is not adversely affected. A decoherence rate of 10“^ results in a 
steady decrease in the fidelity over the course of the computation, and for rates 
even higher the fidelity drops off very quickly. As the figure shows, decoherence 
has a similar effect on all of the benchmarks. 



3.3 The Correlation Between Decoherence and Operational Errors 

Both decoherence and operational error cause a degradation of the fidelity in a 
quantum computation. Decoherence degrades the fidelity through the decay of 
the phonon state, and operational errors result in the accumulation of amplitude 
in unwanted states. The combined effect of these two factors is a degradation 
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(a) Factor 15, Decoherence, No Operational Errors 




(b) Factor21, Decoherence, No Operational Errors 



(c) Factor35, Decoherence, No Operational Errors 





Fig. 4. Fidelity as a function of the number of gates for decoherence in the 
factorl5, factor21, factorSS and factor57 benchmarks 



which is worse than either factor considered alone. We can represent the com- 
bined effect as: Fdec,op = Fdec • Fop + f2{Fdec, Fop) Where Fdec and Fop are the 
fidelities of simulations for decoherence and operational error considered sepa- 
rately, and f2{Fdec, Fop) is the correlation between the two types of error. As 
Table 0 shows the correlation is very low. We calculated the correlation by run- 
ning simulations which considered decoherence and operational error together. 
For all the benchmarks the maximum correlation is at most 1.14 x 10“^. This 
result means that we can simulate decoherence and operational errors separately, 
and combine the results to obtain their collective effect on a calculation. 

The complexity of simulating decoherence alone is much lower than the com- 
plexity of simulating operational errors. This is because the decay transformation 
does not have any off diagonal terms, and therefore it does not introduce any 
new error states. The simulator only needs to represent enough states to repre- 
sent the superposition state. Instead of using the index of a state to represent 
its bit value, the simulator now keeps an extra field for each state which holds 
the current value of the bit string for that state. 

Using this new method for modeling decoherence, the simulator only needs 
to allocate 0(2^) states to simulate the factorization of an L bit number. To 
represent all the qubits for this problem, the simulator would need to allocate 
0(2'^^) states. This reduces the memory requirements and simulation complexity 
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Table 2. Correlation (17) between decoherence and operational errors 



Benchmark and Simulation Model 


Maximum 17 


Average 17 


mult, o — tt/1024 - tt/64 


5.76 X 10“® 


3.63 X 10"® 


mult, <j = Q, ji — tt/1024 - tt/64 


9.26 X 10~® 


5.51 X 10""^ 


factorlS, a — tt/1024, y = 0 


4.15 X 10“® 


3.96 X lO"'^ 


factorl5, a = 0, y = tt/1024 


1.14 X 10~^ 


8.76 X lO"'^ 


grover, y=0, cr = tt/1024 - tt/128 


1.78 X 10“® 


1.02 X lO"'^ 


grover, cr=0, y = tt/1024 - tt/128 


2.67 X 10~® 


2.36 X 10"'^ 



by a factor of 0(2^^). For example the simulation of the factorl5 circuit requires 
only 1/4096 the amount of time as before. 

3.4 The Error Rate per Gate 

As our simulation results show, a modest amount of error destroys even a rel- 
atively small calculation. If quantum computers are to be useful, we must be 
able to perform calculations which are even larger than the ones considered 
here. These larger calculations will therefore require the use of quantum error 
correcting codes ^tea, 96) . Several recent studies have shown that, by using fault 
tolerant techniques, if the error of an individual gate is low enough we can per- 
form a useful quantum calculation of indefinite length [Pres96j [Rnl j/96] . This 
accuracy threshold is expressed in terms of the probability of error per gate. We 
can use the results of our simulation studies to show how this error probability 
relates to decoherence and inaccuracies. 

TableElshows the error rate per gate considering decoherence and operational 
errors for the factor57 benchmark. Error rates for the other factoring benchmarks 
as well as error rates for the combination of decoherence and operational errors 
are given in IOhhe97hl . The error rate is calculated as (1 - Fidelity) /Number of 
Gates. To get the error rate for a particular amount of error, we calculate the 
error rate after every tenth gate in a computation and take the average. A gate 
is either a one, two or three bit controlled-not gate or a single bit rotation. It 
takes five laser pulses on average to implement each gate. 

To perform a computation of arbitrary length the error rate must be about 
10“® for one and two bit gates and 10“^ for three bit gates [Pres96]. An error 
rate of 10“^ corresponds to operational errors with a = tt/ 2048 and decoherence 
of 10-^ Table 0 shows that we can tolerate an even higher level of constant 
magnitude errors. The error rate is 10“® for operational errors with /r = tt/1024. 

To perform a quantum factorization which is more efficient than a classi- 
cal one the error threshold is even tighter, roughly 10“®. This corresponds to 
operational errors with a < tt/ 4096 and a decoherence rate of I0“®. 

Using the error rate of our factoring circuits to predict the error rate for 
error correction circuits assumes that these two types of circuits behave in a 
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Table 3. Average error rate per gate for the factor57 benchmark 



Decoherence 


Operational Errors, p = 0 


Operational Errors, a = 0| 


dec 


Error Rate 


(7 


Error Rate 


M 


Error Rate 


10-'^ 


9.1 X 10"® 


tt/8192 


6.6 X 10"'^ 


tt/4096 


7.9 X 10"'^ 


10"® 


9.1 X 10"'^ 


tt/4096 


2.6 X 10"® 


tt/2048 


3.1 X 10"® 


10"® 


8.8 X 10"® 


tt/2048 


1.0 X 10"® 


tt/1024 


1.2 X 10“® 


10"^ 


6.5 X 10“® 


7t/1024 


3.6 X 10"® 


tt/512 


4.4 X 10“® 



similar manner. These circuits are similar because they are both built from the 
same types of elementary gates, i.e. controlled-not gates. Also the error rate, 
for a given amount of error, is very similar for all the factoring circuits that 
we considered. Lastly effects that we have seen such as error cancellation are 
a by-product of the fact that quantum circuits and gates are implemented in a 
reversible fashion. Because of the nature quantum mechanics quantum circuits 
will always be implemented in this way. 



4 Conclusion 

Quantum computation is a new type of computation which can achieve expo- 
nential parallelism. The feasibility of a quantum computer is threatened by two 
types of errors, decoherence and inaccuracies. In this paper we performed simu- 
lations of a quantum computer running Shor’s factoring algorithm and Grover’s 
database search algorithm to access the feasibility of implementing a quantum 
computer. 

Our simulations show that random inaccuracies (noise) are more significant 
than fixed magnitude inaccuracies for the ion trap quantum computer. Also 
errors in the duration of the laser pulse are more significant than errors in the 
phase of the laser. For the problems considered in this paper we show that 
noise or constant magnitude inaccuracies of magnitude 7 t/ 512 or greater cause 
a significant impact on the fidelity of the calculation. 

Our simulations also show that a quantum computation can tolerate a deco- 
herence rate as high as 10“®. For the ion trap computer this corresponds roughly 
to a decoherence lifetime of 50 milliseconds and a switching speed of 500 nanosec- 
onds. We also show that inaccuracies and decoherence are uncorrelated and can 
be simulated separately. 

Our simulations relate the physical quantities of inaccuracies and decoherence 
to the probability of error per gate. An error rate per gate on the order of 
10“® corresponds to inaccuracies of less than 7 t/ 4096 per laser operation and a 
decoherence rate of 10“®. A quantum computer with this error rate, if it uses 
quantum error correcting codes, could factor a number more efficiently than a 
classical computer. This assumes that the quantum circuits used to implement 
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factoring with error correction codes behave in the same manner as the factoring 
circuits used in this paper. 
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Abstract. We propose an implementation of a quantum controlled- 
NOT gate on the basis of dipole-dipole interacting asymmetric quantum 
dots. Our implementation does not require application of an external 
electric field as the one proposed earlier [Barenco et. al, Phys. Rev. Lett., 
T4, 4083 (1995)]. Results of our numerical simulations show that owing 
to the dot asymmetricity, the coupling constant of the dipole-dipole in- 
teraction can be made as large as TiuJd ~ 50 meV while keeping the 
probability of the spontaneous emission low. This provides conditions 
for resolving different entangled quantum states experimentally. 
Keywords: quantum gate, asymmetric quantum dot, dipole interaction 



1 Introduction 



Development of quantum computation schemes has led to many proposals of 
potentially realizable quantum computers based on various physical systems. 
Among the most known are quantum computers that utilize laser trapped atoms 
nuclear magnetic resonance (NMR) systems [2, all-optical logic gates 0, and 
semiconductor nanostructures 0. Successful experimental demonstrations of one 
and two qubit computers were reported for laser trapped atom systems and NMR 
systems 0. The progress in experimental development of quantum logic gates 
on the basis of semiconductor nanostractures has been far short of these systems. 
This is primarily due to the difficulties in fabrication of high quality quantum 
dot arrays and the decoherence problem p] which is inherently more severe for 
solid state systems. Meanwhile, if these problems are overcome, the quantum 
computer based on semiconductor quantum dots offers an attractive alternative 
to other physical implementations due to its compactness, robustness, and the 
larger number of qubits which can be realized To add to these merits, 

the solid state implementation of a quantum computer avoids dealing with a 
statistical mixture of pure quantum states like in NMR realizations [2], or pitfalls 
related to manipulation of single ions in a trap. Another important advantage of 
the quantum dot implementation is that further development can make use of 
appropriate modification of well-developed conventional silicon based technology. 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 46Q- tTbYi 1999. 

© Springer- Verlag Berlin Heidelberg 1999 



Implementation of Quantum Controlled-NOT Gates 461 



In this paper we propose a solid state implementation of a quantum controlled- 
NOT gate based on coupled asymmetrical quantum dots. These dots have asym- 
metric potential profiles along one of the dots’ axis such as the step quantum 
dot in Fig. 1. Other potential profiles which can be used for our structure were 
discussed in a different context in Refs. [m, m- The potential well asymmetry 
has the effect similar to that of applying an electric field but allows for more de- 
grees of freedom. This is a major advantage for the quantum logic gates. The rest 
of the paper is organized as follows. In section 2 we present an example struc- 
ture and the operational principle of the proposed solid-state implementation of 
quantum controlled-NOT gate; section 3 shows electron charge density distri- 
butions in asymmetric dots obtained by numerical simulation. Discussion and 
comparison of the performance of the proposed gate with other implementations 
are given in section 4. We present our conclusion in section 5. 



EXCITED STATE 


"GLOBAL" 


V 




AIGaAs 


GROUND STATE 




"LOCAL" 


0 


GaAs 

A 



Fig. 1. Potential profile of the quantum dot along the z-axis. The step height of 
the potential profile and its position can be tuned by a proper choice of materials 
and growth conditions. 



2 Structure of the Gate 

We consider two single-electron quantum dots separated by a distance R. The 
ground state and one of the excited states j of each dot form computational basis 
states |0 > and |1 > in much the same way as it was suggested in Ref. |^. The 
actual physical implementation of this gate may have more than one electron 
per dot. However, we restrict our consideration to one electron per dot for the 
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sake of simplicity. The first quantum dot with the resonant energy TluJc acts as 
the control qubit while the second dot with the resonant energy fiujt acts as the 
target qubit. The gate is driven optically by application of a proper train of tt 
pulses P|. The schematic of the potential profile of the dot is given in Fig. 1. The 
dots with such asymmetric potential profile can be grown by self-assembly on 
top of some pre-patterned substrate. Current state-of-the-art technology allows 
for fabrication of such dots with the feature size of 50A- 300A with a 50A- 
200A separation between two dots ^U]. Since these dots are separated by air 
and not by the layer of some semiconductor, the electrical dipole-dipole coupling 
between them is stronger due to the lack of dielectric screening. It also allows us 
to assume hard- wall boundary conditions for the dots. 

Due to the asymmetry of the potential profile of the quantum dot, the charge 
distribution in the ground state and the excited states of each dot is uneven in the 
absence of an external field. The charge distribution can also be tuned and shifted 
in any direction by a proper modification of the potential profile. The difference 
in the charge distributions between two states is particularly pronounced when 
one of the states is so-called ’’local state” and the other one is ’’global state” HH, 
na. We call the state ’’local” if its wave function is almost entirely localized in 
the vicinity of some smaller area inside the dot. In Fig. 1, only the ground state 
is the local state. We intentionally did not specify which of the excited states we 
use as the basis state 1 1 > since for a particular potential profile and a dot size it 
may be better to utilize higher excited states rather than the first excited state. 
Owing to the dipole-dipole interaction, the resonant frequency for transitions 
between the excited and ground states in one dot depends on the electron state 
of the neighboring dot. The coupling constant of the dipole-dipole interaction 
can be written as ^ 



hujd 



dcdt 

AnereoR^ ’ 



( 1 ) 



where h is the Plank’s constant, Co and Cr are the vacuum and relative dielectric 
permittivities, respectively; dc and dt are dipole moments of the control and 
target dots, respectively. The resonant frequency for the target dot becomes 
cot±cod depending on the state of the control dot (|0 > or |1 >). As a result, a n 
pulse of frequency tUt + oJd causes the transition |0 >— > |1 > only provided that 
the control dot is in state |1 >. 

In order to be able to determine the state of the system experimentally, the 
coupling constant TiLOd has to be relatively large 



/\lu — ^t,c A T, (2) 

where F is the emission (absorption) line broadening due to interaction with 
phonons and impurities. The probability of spontaneous transitions has also to 
be small and controlled via a proper choice of the basic computational states 
and structural engineering in order to be able to perform many computations. 
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3 Numerical Simulation 



To determine the electron wave functions and resonant frequencies for each con- 
fined state, we applied the transfer matrix method US). The dipole due to the 
uneven charge distributions was then calculated as 

nL/2 

d= / W{z)z'I'*{z)dz, (3) 

J-L/2 

where L is the dot’s size along z-axis, and S'(z) is the component of the electron 
wave function envelope. Material parameters used in simulations correspond to 
GaAs. The electron effective mass is assumed to be 0.067mo, where ruo is the free 
electron mass. The relative permittivity of GaAs is taken to be 12.9. The charge 
density distributions for the first four confined states together with corresponding 
confined energies and dipole moments are shown in Fig. 2 and Fig. 3 for two 
different dot sizes. In both cases, the step (point A in Fig. 1) is located at the 
middle of the dot. The height of the potential step approximately corresponds 
to the heterojunction band offset between GaAs and Alo, 2 Gao,sAs. 



E=108.7656meV p=-11.1068e-A E=31 5.801 meV p=5.5677e-A 







Fig. 2. Charge density distribution for the first four confined states. Results are 
presented for a quantum dot of width = lOOA and a 150 meV high step. The 
legends on top of each figure give the values of the confined energy and electric 
dipole. 
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As one can see, when the dot size is rather small (lOOA), the dipoles due 
to step-well potentials are also small: p=ll.l e-A for the ground (local) state 
and p=5.6 e-A for the first excited (global) state. It will be shown further 
that these values of dipoles do not provide the couplings strong enough for 
the experimental resolution of different quantum states. For the 200A wide 
quantum dot, the dipoles are 38.8 e-A for the ground state, 22.8 e-A for the 
first excited state, and 27.4 e-A for the second excited state. Contrary to the 
previous case, the first excited state here is still a local state which means that 
its energy (E=142.3 meV) is below the potential step (E=150 meV). The high 
dipole moments and sufficient intersubband energy separations of the confined 
levels in the 200A wide dot make them attractive for the use as computational 
basis states. A further increase in the dot size decreases the subband separation 
too much and is not appropriate for our purposes. It will also be shown in the 
next section that the confined states of the asymmetric quantum dot are rather 
long-lived, which is important for computational basis states. 



E=38.829meV p=-38.7597e-A 





E=1 42.31 37meV p=-22.8403e-A 





Fig. 3. Charge density distributions for the first four confined states. Results 
are presented for a quantum dot of width = 200A. The legends on top of 
each figure give the values of the confined energy and electric dipole. 
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4 Discussion 

We have calculated the dipole-dipole interaction coupling constant for the case 
of two 200A wide quantum dots separated by a distance of lOOA. Fig. 4 shows 
dipole moments (upper panel) and the coupling constant (lower panel) as func- 
tions of the dot size along the ^-axis. The height of the potential step is 300 
meV. The coupling constant denoted by the solid line is plotted for the gate 
that utilizes the ground state and the first excited state as the computational 
basis states. The dashed line corresponds to the gate implemented on the basis 
of the ground state and second excited state. In the first case (ground state), the 
coupling constant grows monotonically while in the second case there are well 
resolved minima which correspond to the dot size and potential profile when the 
symmetry of the charge density distribution is partially restored. Thus, one has 
to control precisely the dot size and potential step height when designing the 
controlled-NOT gate in order to avoid ’’small-dipole” conditions. Rather large 
values of the coupling (as compared to the broadening) allows for experimental 
resolution of different quantum states. Moreover, the condition of Eq. (2) may 
be met even at room temperature. 

A large value of the coupling constant also reduces requirements for the tt 
pulse selectivity. The length of the pulse has to be in the limits specified by 
the condition ^ 

<T^ < Tcoh, (4) 

uJd 

where Tcoh is the quantum coherence time of the system which is limited by 
spontaneous emission time in our case. For the time allowed for computation 
(Tcoh), we want to send as many tt pulses as possible thus increasing the number 
of computations and prepared quantum states. Since it is difficult to increase 
upper limit of the unequility (4), we may try to decrease the lower limit. For 
the asymmetric dot implementation, the inverse of the coupling constant is on 
the order of 4 x 10“^'^ s. For comparison, the number obtained in Ref. ^ for 
the gate based on square potential quantum dots biased with an electric field is 
on the order of 10“^^ s. This means that in principle, a femtosecond laser can 
be used to drive our controlled-NOT gate provided that such laser exists in the 
working frequency range. 

In order to maintain coherence during the computation time and to make 
Tcoh as large as possible, one has to minimize the coupling of the gate to the 
environment. Particularly for our structure, we have to minimize the probability 
of spontaneous transitions between the states that serve as the computational 
basis. Our results indicate that the use of the asymmetric potential instead 
of biasing electric field may help solve this problem as well. To show this, we 
calculate the oscillator strength of the transitions using the definition 

f=^\ nz)Wr\^*{z)dz\\ (5) 

« J-L/2 

where lo is the transition frequency, u is the unit vector along the direction of 
the incident photon polarization, r is the real space radial vector. For simplicity. 
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Fig. 4. Upper panel: dipole moments vs. dot size. Solid line, dashed line, and 
dashed-dotted line correspond to the ground state, the first excited state and the 
second excited state, respectively. Lower panel: coupling constant of the dipole- 
dipole interaction vs. dot size. Solid line corresponds to the coupling between the 
ground state and the first excited state, dashed line corresponds to the coupling 
between the ground state and the second excited state. The dot separation is 
fixed at 100 A 



we assume that v — and, thus ur = z. The oscillator stregth of the transition 
between the ground and first excited states in a symmetric square potential well 
(120A wide) subjected to an electric field of 36 kV/cm is about 0.95. The same 
transition in a step potential structure has the value of / « 0.60. The transition 
strength between the ground and second excited state in our structure is 0.3 and 
can be further reduced down to 0.08 if the electric field of 36 kV/cm is applied. 
From the above arguments, we can conclude that the spontaneous relaxation 
time Ts(oc 1//) is bigger in our structure than that in corresponding symmetric 
dots biased with the electric field. This allows for improvement of the coherence 
time of the quantum gate by the proper choice of the potential. 
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5 Conclusions 

We propose a solid-state implementation of a quantum controlled-NOT gate 
based on coupled asymmetric quantum dots. The structure can be realized using 
the state-of-the-art technology. The results of our numerical simulation indicate 
that owing to the dot asymmetricity, the coupling constant of the dipole-dipole 
interaction can be made very large as compared to the transition line broadening 
and to the other previously proposed structures 0,0,0 We show that proba- 
bility of spontaneous emission can be reduced in our gate, thus making it easier 
to meet the time coherence requirements. Discussion of other issues related to 
physical implementation of quantum logic gates is also presented. Finally, we 
argue that our gate can be driven by a femtosecond laser if one is designed for 
an appropriate frequency range. 
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Abstract. This paper enstimates the stability of dipole-dipole interac- 
tion in quantum dot array, and proposes a novel solid state quantum 
CCN (controlled controlled not) gate having a block structure, which 
is effective to maintain quantum mechanical coherence and reduce both 
the bit error and the phase error. The spatiotemporal dynamics of quan- 
tum computation process involving the quantum entangled pure states 
is illustrated. 

key words: dipole-dipole interaetion, quantum dot array, solid block, quan- 
tum CCN gate, ensemble cancellation of errors, quantum entangled pure 
states 



1 Introduction 

We have been proposing solid state quantum gates, employing a coherent mode 
generated by the dipole-dipole interaction among ensemble of Frenkel type ex- 
citons, each of which being confined in a three dimensional quantum dot |p. 

Our treatment of the dipole-dipole interaction is different from previous ones 
especially in that we employed the parallel (analogy to ferromagnetic) 
dipole-dipole interaction to enhance the coherence, not only to directly execute 
any particular logic as in the case of ref. P| • In the next section (0 of this paper, 
the stability of this dipole-dipole interaction is estimated energetically. 

A pair of the block constitutes a realistic solid state quantum CN (controlled 
not) gate, having sufficient coherence due to the dipole-dipole interactions among 
the induced dipole moments, and also due to cancellation of phase fluctuation 
in the ensemble of quantum dots Then it is possible to construct universal 
quantum systems, employing this CN gates and few other kinds of simple gates. 
In this way, a quantum CCN (controlled controlled not) gate is proposed as an 
essential part of the universal system for quantum computation. 

The spatiotemporal dynamics of the quantum entangled pure states, which 
is the kernel of the quantum super-parallel computation, is illustrated for the 
proposed quantum CCN gate of logical block structure. The logical block may 
work as an artificial molecule having a gigantic dipole moment. 

C.P. Williams (Ed.): QCQC’98, LNCS 1509, pp. 4fi8- IT771 1999. 
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2 Energetics of the Dipole-Dipole Interaction in the 
Block 



2.1 Dipole-Dipole Energy 



The array of dipole moments p„ arranged at positions Rm impose a resultant 
Electric field E(n) on a site at R„ as given below. 



E(n) = 



Ittco 



E 



Rn Rn 



-3 



Pm ■ (Rn Rm) (Rn Rm) 
I Rn - Rm, 1^ 



( 2 . 1 ) 



in SI unit. Then the average energy Q of the total dipole-dipole interaction in a 
block per sigle quantum dot may be given as 



Q= j^EP" 



( 2 . 2 ) 



where N is the total number of quantum dots in the block. 

The dipole-dipole energy of eq. 112. 211 is plotted as a function of the separation 
of quantum dots in Fig.l, for the case of cubic array having identical dipole 
moments all aligned along z direction. The dipole moment is assumed to be, 
as an example, that of GaAs; p = | p | = ed = 3.5 x 10“^°e mC 0, with 
elementary electric charge e = 1.60219 x 10“^® C = 4.80321 x 10“^° esu. The 
plotting is done for three kinds of blocks, of which dimensions (x, y, z) are (1,1, 
lo), (11, 2) and (5, 3, 5). 

As is explained in detail elsewhere [IJ, including this dipole-dipole energy in 
the starting Hamiltonian, we derived a localized coherent mode below a con- 
ventional excitonic band, in a regime where the population difference is below 
a critical value, which is estimated to be for an example 0.87 (corresponding to 
94% excitation) in the case of 2.6nm spacing GaAs quantum dots. In this model, 
the confinement of the exciton is assumed to be so perfect, that no considerable 
overlapping of wave function and no tunneling among them is expected. This 
localized mode may be useful as the excited state of a quantum gate, because of 
its ferromagnetic type stability. 



2.2 Curie Temperature 

The field of eg. (12. Ill could be thought of as the molecular (or Weiss) field in 
ferromagnetic materials. For array of identical dipoles aligned along the z di- 
rection, the molecular field is given as | E(n) | = A p, and the Gurie- Weiss 
law X = t-t t>e reasonable for the susceptibility y with a Gurie constant 

C = Tc = CX, and the Boltzmann constant k, following the procedure of 
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Fig. 1. Dipole-dipole interaction energy per quantum dot as the function of 
the spacings of quantum dots, for different block dimensions 1-1-10 (solid), 1- 
1-2 (dotted), and 5-3-5 (dot-dash). The Curie temperature (Tc) for the dipole- 
dipole energy 75.0, 26.1, 6.8, and 0.35meV are also indicated. The dipole length 
is assumed to be 3.5 x 10“^nm as in GaAs. 



ferromagnetism 0. The Weiss constant A is evaluated using eqs. (EH) and (1^ 
to give the Curie temperatire. 



Tc = - 



Q_ 

3fc 



(2.3) 



Typical values of Tc, i.e. 290, 101, 26, and 1.3K are given in Fig.l for the dipole- 
dipole energies 75.0, 26.1, 6.8, and 0.35meV respectively. 



2.3 Ensemble Stability 

It is seen from Fig.l that the block having the minimum dimension in x-y plane 
has the largest dipole-dipole energy, ex. (1, 1, 10) block which is an one dimen- 
sional array with 10 quantum dots along the z direction. In this case, the parallel 



Spatiotemporal Dynamics 471 



(ferromagnetic type) phase is stable even at room temperature if the quantum 
dot spacing is less than 1.7nm, and it is stable at 77K and 4K if the spacing 
is below 2.6nm and 7.1nm respectively. In the (5, 3, 5) block having 5x3 two 
dimensional arry in x-y plane and 5 dots along the z direction, it is stable at 
77K and 4K if the spacing is less than Inm and 2.8nm respectively. 

As a matter of course, the stability increases if we employ materials with 
larger dipole moments; for example in the case of InSb of which dipole moment 
is approximately 7.14 times larger than that of GaAs 0, the dipole-dipole energy 
is larger by a factor of 51, resulting in the room temperature Tc at separation of 
6.2nm in (1, 1, 10) block, and at 2.5nm in (5, 3, 5) block. This might improve 
further if some organic materials with gigantic dipole moments are used. 

It is also obvious that in this block for the dipole-dipole interaction, the 
stability does not directly depend on the volume of the block or the number of 
dipoles, although that is the case in other kinds of memory devices j^. In our 
case, the more the side by side dipole arrangement, the less the total dipole- 
dipole energy gain, because a dipole moment produces a field in the antiparallel 
direction at the side by side positions. Therefore, the stability depends on the 
shape of the block rather than the volume of it. 



3 Immunity of the Quantum Logical Blocks to Bit and 
Phase Errors 



3.1 Bit Error Immunity 



There is a whole bunch of energy levels arising from all the quantum dots in the 
same block, including energetically degenerate states. Superposition (or intra- 
block entanglements) of states having same multiple number n of excited sites 
may be generated in a block having N quantum dots, by an appropriately pre- 
cision 7T pulse. 



This state 



n\{N — n)l 



Nl 



l,l---l,0,0,---) + 
| 0 , 1 , 1 --- 1 , 0 ,---) + | 0 , 0 , 1 , 1 --- 1 , 0 ,- 
^ |i) 



and the physical ground state 

| 0 , 0 , 0 , 



| 0 > 



(3.1) 

(3.2) 



should work as a dual basis (| 0), | 1)) for the computation. Thanks to the 
dipole-dipole interaction, it is obvious from the results of previous section m 
that these basis states are immune to thermal agitation providing the basic 
coherence for the quantum computation. First of all, this is powerful to prevent 
bit errors (amplitude errors) such as | 0 ) — > | 1 ) and | 1 ) ^ | 0 ), but this is also 
effective to avoid phase errors as explained in the next subsection 
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3.2 Phase Error Immunity 



It may be possible to generate another orthogonal basis (| 6), | 1) ) by an 
Hadamard transformation f | 





|i)) = 


|6) 


(3.3) 












i)) = 


» |i). 


(3.4) 



This transformation increases the resistance of the coding against phase er- 
rors, because a phase error in (| 0), | 1)) basis corresponds to a bit error in 
the (I 0), I 1)) basis, as is easily seen from the transformation in ens. (l3.3ll and 
(ft. 411 . for the case of phase error such as | 0) ^ | 0) and | 1) ^ — | 1), of which 
consideration may be sufficient for our purpose [Zj. 

There is also cancellation of phase fluctuations in the quantum dot ensem- 
ble. For a block with the environment, the phase fluctuations of an off-diagonal 
element of the density matrix in the (|0), |1)) basis or even in the (| 0), | 1)) 
basis, should cancel each other, as the number of excited site n increases. This 
saves the off-diagonal term which generates the entanglement. 

The situation is easily seen in the bosonic heat bath model, where the phase 
fluctuations of the excited state and the ground state both at site k may be 
represented as ipk and —tpk respectively, and their distributions are Gaussian 
having their means at zero 0. The total phase fluctuation may be expressed as 



— — + </>2 + • • • + 4'k-l) + {4>k + (f’k+l + • • • 

+ (l^k+n-l) — {(j^k+n + (j^k+n+l + ’ ' ’ + 4>n) 
— > 0 for n, N ^ oo, 



for a state i, leading to 

e^i 0,0---0,l,l---l,0,0,---), 

= I I U) | U+i) 

X . . . 6"*'^'“+'* I Ofc+„) 6"*'^*^+"+! I Ofc+„+i) • • 

_ g— (^1+02H h^fc-l) + (0fc+^fc + lH h^fc + n-l) 

-iH + r.+<l>k + ,^ + l + - + <PN) I 0, 0 • • • 0,1, 1 • • • 1,0, 0, • • •), 
^|0,0---0,1,1---1,0,0,---),. 



(3.5) 



(3.6) 



This may be considered as the ensemble cancellation of random phase fluctua- 
tion. 
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3.3 Error Prevention Methods 

There may be two methods to avoid the phase decoherence of encoded states 
during some significantly complex computation. 

One is to devide the total computational algorithm into shorter pieces, in 

such a way that each piece could be executed by a small block structure. If we 

denote the statistical variance of the temporal phase fluctuation in each identical 

quantum dot as tr^, then the ensemble average in ith block consisting of Ni 

2 

quantum dots gives variance fj- . Therefore the whole computing structure of M 
blocks may suffer from which corresponds to a standard deviation on 

the order of 

However, this deviation remains finite, so long as each piece of the algorithm 
(eventually M) is sufficiently small. The permissible value of ^ depends on the 
employed error correction scheme and the requied accuracy of computation. 

The other method is to execute the main part of the computation in the 
(I 0), I 1) ) basis given by eqs. (t3..3ll and l|3.4|) . In this basis, the phase error 
is suppressed directly by the dipole-dipole interaction among the ensemble of 
quantun dots in each block, as discussed in mi and 3221 As a matter of course, 
some gates or operations should be added to the input and output ports to 
practice the transformation of ens. (13.311 and (13.411 between the bases. 



4 Quantum Computing Systems 



It is possible to construct a quantum computing system out of the blocks given 
in previous section (©• As a simple example, a CCN gate is given in Fig. 2, 
which is an essential structure for universal quantum computing system. 



Each line of bit a, b, ••• consists of blocks ai, a, 2 , ■■■, bi, b2, •••.A 
CN operation Pcn is implemented by the couple of blocks facing each other 
(i.e. 03 and &3, 05 and 65) and a photonic tt pulse. The separation of quantum 
dots is designed to differ line by line, to distinguish individual line from each 
other photonically p. Rotational operations and are also 

implemented by the couples of blocks (i.e. 62 and C2, &4 and C4, and og and cg) 
and a photonic ^ pulse and a ^ pulse respectively. The phase difference between 

lines a, b and line c is adjusted by a pair of controlled phase shifters {S^f^ and 

4 )- 



The CCN operation of Fig. 2 may be described as 



PCCN = SlSX/^PcNabRt:^^PcNabR^^^^ 



Q 2 Q 2 

•^ba'^ab 



1 + ^(1 + 0 "a)(l + “ CTc) 



(4.1) 



(4.2) 
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n/2 It/2 

Sba Sab 




Fig. 2. A solid block CCN gate, combining two CN gates (Pat), two f controlled 
rotators and RZc^), one ^ controlled rotator and a pair of |- 

controlled phase shifters and Arrows with tt, and ^ denote the 
respective photonic pulses. 



with = I I , where cr^, 

VT* 1 ; ’ 

=p correspond to each other. 



cr^, cr^ are the Pauli operators, and ± and 



The non-Boolean superposed state such as ^ (|6) + |1)) or ^ (|6) + |1)) 
may be generated initially by a photonic ^ pulse bit by bit, and then conveyed 
from one block to the other, across different bit lines through the CN gate 
and the rotational gates, as well as propagated within the same bit line. After 
an appropriate time period, an expected entangled pure state (spontaneously 
generated superposition of different product states involving different bit lines) 
is attained, being specific to the arrangement of the blocks. 



The depth and/or width of the quantum dots may be designed to cause a 
small but sequential decrease of the energy gap, starting from the input port 
toward the output port, block by block along each bit line. Excitons carrying 
the dipole moment are transferred in one-way fashion through this biased path, 
bacause there exists some dissipative energy loss which deprives the excitons of 
their energy to climb back the line. Application of microwave pulses with proper 
energy may work as clock signals. These methods facilitate the directional quasi- 
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unitary evolution of the whole system, which is needed and sufficient for the 
quantum computation. 



5 Generation of the Quantum Entangled Pure States and 
Its Spontaneous Accumulation into the Output Port 

The spatiotemporal dynamics of the quantum entangled pure states is essential 
for the quantum super parallel computation. In the quantum CCN gate of Fig. 2, 
it is possible to define a basin for each quantum entanglement generated at a CN 
gate (P) or at a rotator (R), as indicated in Fig. 3 by dotted or dot-dash lines 
on the top view of the CCN gate, labeling by the times of generation ti, < 2 > • • •, 
and The basin may be thought of as the area that is going to be influenced 
by a gate starting each entanglement. 

All the necessary Boolean algebra are executed by inter-block operations, 
whereas the purely quantum mechanical superposition and interference of differ- 
ent states will take place in an intra-block manner (within each block) . Nonethe- 
less, the quantum interplay of different blocks will be spontaneously realized as 
the inter-block entanglement via Coulombic interaction, on the way of the uni- 
tary (or quasi-unitary) evolution. Then, blocks become incompatible each other, 
revealing the peculiarity of quantum mechanics. It is no longer possible to mea- 
sure the states of individual blocks simultaneously, without destroying the states 
as they are, until the whole thing settle down to yield the computational result 
in the output port. 

The resultant final state is designed to form a pattern of electron excitation 
with sharply localized peaks at particular blocks in the output port, as the 
results of the interference of wave functions, which may be achieved by rotational 
operations embeded in the algorithm such as quantum Fourier transfomations. 
Therefore this result should be read by some kind of photonic probing, involving 
some auxiliary higher energy states Q. 

6 Concluding Remarks 

A quantum CCN gate is proposed as the essential elements of the solid state 
quantum computing system, employing the logical blocks of three dimensional 
quantum dot array. The spatiotemporal dynamics of the quantum entangled 
pure states, which is essential for the execution of the quantum super-parallel 
computation, is illustrated for the proposed quantum CCN gate. 

The ferromagnetic type dipole-dipole interaction among the ensemble of in- 
duced dipole moments suppresses not only bit errors in the (| 0), | 1)) basis, 
but also phase errors in (| 0), | 1)) basis. Furthermore, the ensemble cancella- 
tion of the phase fluctuation in the block is also expected to improve the phase 
coherence for the quantum computation. 

It is needless to mention that significant technological development is ex- 
pected, for example precision fabrication of quantum dot array blocks either of 
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semiconductor or organic compounds, in order to implement the proposed quan- 
tum systems for practical computation. Focused photonic pulse technique should 
also be developed, for an example the scanning near-held method, for the input 
superposition and the rotational operations. 




Fig. 3. Spatiotemporal dynamics of quantum entangled pure states in the quan- 
tum CCN gate of quantum logical blocks. The basins of 5 entangled pure states 
generated at 5 different times ti, t 2 , ■ ■ ■ are indicated by the dotted or dot-dash 
lines.. 
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